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PLANT GROUP 2 PROMOTERS AND USES THEREOF 

BACKGROUND OF THE INVENTION 
This invention relates to compositions and methods 
useful in the production of transgenic plants. In particular, 
the invention relates to Group 2 (Gp2) plant promoter 
sequences and to expression cassettes containing Gp2 plant 
promoter sequences. The invention also relates to vectors and 
transgenic plants containing Gp2 plant promoter sequences that 
are operably linked to heterologous DNA sequences. In 
addition, the invention relates to methods of producing 
transgenic plants by using vectors containing Gp2 promoter 
sequences . 

Isolated plant promoters are useful in the genetic 
engineering of plants to produce transgenic plants with 
desired phenotypic characteristics. In order to produce such 
transgenic plants, an isolated plant promoter is inserted into 
a vector and operably linked to a heterologous DNA sequence. 
Plant cells can then be transformed in a variety of ways by 
DNA constructs containing an isolated plant promoter fused to 
heterologous DNA sequences. The result of this transformation 
is that the plant promoter operably linked to the heterologous 
DNA is inserted into the genome of the transformed plant cell. 
Furthermore, the regulation of the heterologous DNA in the 
transformed plant cell can be controlled by the expression of 
the promoter. 

There are a variety of different approaches for 
producing a desired phenotype in a transgenic plant, depending 
on the nature of the heterologous sequences coupled to the 
isolated plant promoter. For example, expression of a novel 
gene that is not normally expressed in plant or in a 
particular tissue of a plant may confer a phenotypic change. 
Alternatively, the expression of a sense or an anti-sense 
construct introduced into transgenic plants can cause the 
inhibition of expression of endogenous plant genes. This 
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inhibition of expression can, in turn, produce the desired 
phenotypic change « 

There is a need for a variety of different promoters 
to be used in the genetic engineering of plants. These 
promoters are of several types • Constitutive promoters are 
one such commonly used type of promoter. Constitutive 
promoters are those which are capable of expressing operably 
linked DNA sequences in all tissues of a plant throughout 
normal development. In contrast to constitutive promoters, 
tissue-specific promoters are those promoters that are capable 
of selectively expressing heterologous DMA seG[uences in 
certain plant tissues. Tissue-specific promoters may also be 
inducible, e.g., by application of external inducing agents. 
Constitutive and tissue-specific promoters are both used in 
15 the genetic engineering of plants, and have value for many 
different potential applications in this field. 

Constitutive plant promoters may be obtained by 
isolating the regulatory region of a plant gene that is 
constitutively expressed. In addition to promoters obtained 
from plant genes, there are also promoters of bacterial and 
viral origin which have been used to constitutively express 
novel sequences in plant tissues. Examples of such promoters 
from bacteria include the octopine synthase (ocs) promoter, 
the nopaline synthase (nos) promoter and others derived from 
25 native Ti plasmids (see Herrara-Estrella et al . (1983) Nature 
303:209-213). The 35S and 19S RNA promoters of cauliflower 
mosaic virus are commonly used examples of viral promoters 
(see Odel et ai . (1985) Nature 313:810-812). 

In contrast to constitutive promoters, tissue- 
30 specific promoters are generally isolated from the promoter 
regions of plant genes that are selectively expressed in a 
specific plant tissue. These promoters can be fused with a 
heterologous DNA sequence and used to transform a plant cell 
to create transgenic plants that selectively express the 
35 heterologous DNA in a specific tissue. For example, the 

promoter regions from the fruit-specific, ethylene regulated 
genes E4 and E8 and from the fruit-specific polygalacturonase 
gene have been used to direct fruit specific expression of a 
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heterologous DNA sequence in transgenic tomato plants. (See 
Cordes et al . , Plant C 11 (1989) 1; 1025-1034, Deikman and 
Fischer, EMBO J. (1988) 7; 3 3 15-3 320 and Delia Penna et al . , 
Proc. Natl. Acad. Sci . USA (1986) 83:6420-6424.) 

The discovery of both new constitutive promoters and 
new tissue-specific promoters is desired for the controlled 
expression of various nucleic acid sequences that are 
genetically engineered into transgenic plants. There are many 
valuable potential applications of genetic engineering of 
plants. A variety of plant promoters with different 
characteristics and which are effective in diffeirent species 
of plants is desirable in order to bring these potential 
applications into practice. 

SUMMARY OF THE INVENTION 
The present invention provides for an isolated plant 
Gp2 promoter containing a nucleic acid sequence that is either 
identical to or which has substantial sequence identity to the 
nucleic acid sequence depicted in Seq. ID No. 4. 
Preferentially, the Gp2 promoter is from a member of the 
family Solanaceae. More preferably, the Gp2 promoter is from 
the genus Solanum of the family Solanaceae. The promoter can 
be, for example, from a pepper plant. The Gp2 promoter can 
be, for instance, either constitutive or tissue-specific, 
constitutive Gp2 promoters typically are from about 250 to 
about 1,250 nucleotides in length. A preferred constitutive 
Gp2 promoter is about 1,233 nucleotides in length. An example 
of such a constitutive promoter is shown as Seq. ID No. 4. 
Tissue-specific Gp2 promoters are typically from about 1,500 
to about 3,500 nucleotides in length. Tissue-specific 
promoters of the invention comprise a nucleic acid sequence 
that is identical to or which has substantial sequence 
identity to the Gp2 promoter sequence contained in plasmid 
PEP4-2GUS. A preferred a tissue-specific Gp2 promoter is 
about 2,700 nucleotides in length. An example of such a 
promoter is the Gp2 promoter sequence contained in plasmid 
PEP4-2GUS. 
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In addition to the above described promoter 
sequences, the present invention also provides for a vector 
having a plant Gp2 promoter operably linked to a heterologous 
nucleic acid sequence. The promoter in such a vector contains 
a nucleic acid sequence that is either identical to or has 
substantial sequence identity to the nucleic acid sequence 
shovm as Seq. ID No. 4. The vector preferentially contains a 
promoter from the family Solanacea^ and more preferably from 
the genus Solanum. For instance, the promoter can be from a 
pepper plant. The Gp2 promoter in the vector can be, for 
example, either constitutive or tissue-specific. A tissue- 
specific Gp2 promoter can have a nucleic acid sequence that is 
identical to or which has substantial sequence identity to the 
Gp2 promoter sequence contained in plasmid PEP4-2GUS. An 
15 example of a constitutive promoter is shown as Seq. ID No. 4. 

An example of a tissue-specific Gp2 promoter that can be used 
in the vector is the promoter sequence contained in plasmid 
PEP4-2GUS. 

The present invention also provides for expression 
cassettes that have a Gp2 promoter operably linked to a 
heterologous nucleic acid sequence. The Gp2 promoter in the 
expression cassette contains a nucleic acid sequence that is 
either identical to or which has substantial sequence identity 
to the nucleic acid sequence depicted in Seq, ID No. 4. The 
25 expression cassette preferentially contains a Gp2 promoter 

from the family Solanaceae and more preferably from the genus 
Solanum, For instance, the promoter can be from a pepper 
plant. The Gp2 promoter in the expression cassette can be, 
for example, either constitutive or tissue-specif ic* An 
30 example of such a constitutive promoter is shown as Seq. ID 

No, 4. A tissue-specific Gp2 promoter can have a nucleic acid 
sequence that is identical to or which has substantial 
sequence identity to the Gp2 promoter sequence contained in 
plasmid PEP4-2GUS. An example of a tissue-specific Gp2 
3 5 promoter that can be used in an expression cassette is the 
promoter sequence contained in plasmid PEP4-2GUS. 

The present invention also provides for an isolated 
plant Gp2 gene or gene fragment having a plant Gp2 promoter 
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operably linked to a nucleic acid sequence encoding a plant 
Gp2 protein. The plant Gp2 protein has an amino acid sequence 
that is either identical to or which has substantial sequence 
identity to the amino acid sequence depicted in Seq. ID No. 2. 
Preferentially, the plant Gp2 protein is from the family 
Solanacoae and more preferably from the genus Solanum. For 
instance, the Gp2 protein can be from a pepper plant. The 
pepper plant Gp2 amino acid sequence can be, for instance, the 
amino acid sequence of Seq. ID No. 2. The isolated plant Gp2 
gene or gene fragment can have, for example, the nucleic acid 
sequence depicted in Seq. ID No. 3. 

In addition to nucleic acids, vectors and expression 
cassettes, the present invention also provides for transgenic 
plants that have a plant Gp2 promoter operably linked to a 
heterologous nucleic acid sequence. This Gp2 promoter 
contains a nucleic acid sequence identical to or with 
substantial sequence identity to the nucleic acid sequence 
depicted in Seq. ID No. 4. The heterologous DNA sequence can 
be expressed, for example, constitutively or in a tissue 
specific manner in the transgenic plant. For constitutive 
expression, the Gp2 promoter can have, for instance, the 
nucleic acid sequence depicted in Seq. ID No. 4. For tissue- 
specific expression, including expression of the heterologous 
DNA sequence selectively in fruit tissue of the transgenic 
plant, the promoter can have a nucleic acid sequence that is 
identical to or which has substantial sequence identity to the 
nucleic acid sequence of the Gp2 promoter contained in plasmid 
PEP4-2GUS. For tissue specific expression, the Gp2 promoter 
can be, for example, the GP2 promoter contained in plasmid 
PEP4-2GUS. The transgenic plant is preferably a member of the 
family Solanaceae and more preferably a member of the genus 
Solanum. Transgenic plants can be, for instance, pepper 
plants, tomato plants, and tobacco plants. 

The present invention also provides for methods of 
expressing heterologous DNA sequences in plant cells. In 
these methods, a plant cell is transformed with a vector that 
has a Gp2 promoter operably linked to a heterologous nucleic 
acid sequence. After transformation with the vector, the 
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plant cell is grown under conditions where heterologous 
nucleic acid sequences are expressed. The Gp2 promoter used 
in these methods contains a nucleic acid sequence identical to 
or with substantial sequence identity to the nucleic acid 
5 sequence depicted in Seq. ID No. 4. This Gp2 promoter can be, 
for example, a constitutive promoter or a tissue-specific 
promoter. A constitutive Gp2 promoter can have, for example, 
the amino acid sequence depicted in Seq. ID No. 4. A tissue 
specific promoter can have a nucleic acid sequence that is 

10 identical to or which has substantial sequence identity to the 
Gp2 promoter sequence contained in plasmid PEP4-2GUS. A 
tissue specific promoter can be, for instance, the Gp2 
promoter contained in plasmid PEP4-2GUS. The plant cell of 
this method is preferably from a member of the family 

15 Sol^naceae and more preferably from the genus Solstnum. For 

instance, the plant cell can be a pepper plant cell, a tomato 
plant cell or a tobacco plant cell. 



BRIEF DESCRIPTION OF THE DRAWINGS 

2 0 Figure 1. RNA slot blot hybridized to ^^P Gp2 cDNA, 

Ten Mg of total RNA isolated from pepper tissue from green 
fruit, turning fruit, red fruit, leaf and root was applied to 
a nylon filter using a slot blot apparatus and hybridized to 
^^P labeled GP2 CDNA. Total sugar beet RNA and salmon sperm 

2 5 DNA were used a negative controls. 

Figure 2. Genomic Southern hybridization of pepper 
DNA with ^2p Gp2 cDNA. lO^tg of genomic pepper DNA was 
digested with restriction enzymes EcoRI (E) , Hindlll (HIII) , 
Ncol (N) , Spel (Sp), Sspl (S) and Xbal (X) and separated on an 

30 agarose gel. The DNA was transferred to a nylon membrane and 
hybridized to ^^p labelled Gp2 cDNA. 

Figure 3. RNAse protection analysis on total RNA 
prepared from pepper. Total RNA was isolated from the 
indicated plant tissues without pretreatment . Total RNA was 

35 also isolated from pretreated leaf tissue. For the pretreated 
leaf tissue, total RNA was isolated 24 hours after wounding, 
after 24 hours in 0.01% salicylic acid or after 24 hours in 
water. For all tissues, lO/xg of RNA was hybridized to the ^^P 
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labeled Gp2 riboprobe, the hybrids were digested with RNAse A 
and Tl and the protected fragments were separated on a 6% 
acrylamide urea gel. 

Figure 4: Restriction maps of the phage 4, phage 12 
and phage 20 Gp2 genomic clones obtained from Capsicum annum 
cv D2XYBA pepper plants. DNA prepared from the three genomic 
phage, #4, #12, and # 20, were digested with Ecol, Hindlll, 
Spel, and Ncol and restriction maps were generated. The 
following abbreviations are used: E=EcoI, H=HindIIl, Sp=SpeI, 
and N=NcoI. 

Figure 5: Restriction map of Gp2 clone PEP4-2. DNA 
from clone PEP4-2 was digested with Apal, Ncol, Sad, Bglll 
and PvuII and a restriction map was generated. The coding 
region is the boxed region indicated on the restriction map. 

Figure 6: Restriction map of Gp2 clone PEP4-6. The 
coding region is the boxed region indicated on the restriction 
map. 

Figure 7: Restriction maps of vectors WTT2132 and 
WTT2179. The following abbreviations are used: B=BamHI, Bs= 
BstEII, E=EcoRI, H=HindIIl, N=NcoI, and mcs-molecular cloning 
site. 

Figure 8: Restriction map of plasmid PEP4-6AccS. 

Figure 9: Restriction map diagrams of plasmids 
PEP4-6GUS, PEP4-2, and PEP4-2GUS. The region indicated in 
black is the Gp2 promoter; the region indicated in white is 
the Gp2 coding region; the hatched region is the GUS coding 
region; the striped region is the nos 3- region; and the ATG 
start codon is indicated with an arrow. 

Figure 10: Diagram showing the construction of 
plasmid pG2-ClAlO from pCP35M. The abbreviation "mcs" 
indicates the molecular cloning site in WTT2132. 

DEFINITIONS 

The term "nucleic acids", as used herein, refers to 
either DNA or RNA. "Nucleic acid sequence" or "polynucleotide 
sequence" refers to a single- or double-stranded polymer of 
deoxyribonucleotide or ribonucleotide bases read from the 5' 
to the 3' end. it includes self -replicating plasmids. 
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infectious polymers of DNA or RNA and nonfunctional DNA or 
RNA. 

In the polynucleotide notation used herein, unless 
specified otherwise, the left hand end of single-stranded 
5 polynucleotide sequences is the 5* end; the left hand 

direction of double-stranded polynucleotide seqpiences is 
referred to as the 5' direction. The direction of 5 • to 3 • 
addition of nascent RNA transcripts is referred to as the 
transcription direction; sequence regions on the DNA strand 

10 having the same sequence as the RNA and which are 5 • to the 5 ' 
end of the RNA transcript are referred to as "upstream 
sequences*'; sequence regions on the DNA strand having the same 
sequence as the RNA and which are 3 • to the 3 • end of the RNA 
transcript are referred to as "downstream sequences". 

15 The term "promoter" refers to a region of DNA 

upstream from the translational start codon and which is 
involved in recognition and binding of RNA polymerase and 
other proteins to initiate transcription, A "plant promoter" 
is a promoter capable of initiating transcription in plant 

20 cells. The terms "Gp2 plant promoter" or "Gp2 promoter" as 

used herein refer to plant promoters derived from the promoter 
region of a plant Gp2 gene. 

The terms "constitutive promoter or constitutive 
plant promoter" as used herein refer to a plant promoter which 

25 is capable of expressing operably linked DNA sequences in all 
tissues or nearly all tissues of a plant during normal 
development* The short form of the Gp2 promoter, as described 
herein, is an example of a constitutive promoter. The terms 
"inducible promoter" or inducible plant promoter", as used 

30 herein, refer to plant promoters that are capable of 

selectively expressing operably linked DNA sequences at 
P^^^icuiar times in response to endogenous or external 
stimuli . 

The term "tissue-specific promoter" as used herein 
35 refers to plant promoters that are capable of selectively 

expressing operably linked DNA sequences, in particular plant 
tissues. This means that the expression of the operatively 
linked DNA sequences is higher in one or several plant tissues 
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than it is in the other tissues of the plant. For example, 
the long form of the Gp2 promoter is a tissue-specific 
promoter that selectively expresses operably linked DNA 
sequences in fruit tissue during ripening. 
5 The term "operably linked" as used herein refers to 

linkage of a promoter upstream from a DNA sequence such that 
the promoter mediates transcription of the DNA sequence. It 
is understood that the promoter sequence also includes 
transcribed sequences between the transcriptional start and 
10 the translational start codon. 

The phrase "expression cassette", refers to 
nucleotide sequences which are capable of affecting expression 
of a structural gene in hosts compatible with such sequences. 
Such cassettes include at least promoters and optionally, 
15 transcription termination signals. Additional factors 

necessary or helpful in effecting expression may also be used 
as described herein. 

The term "vector", refers to viral expression 
systems and autonomous self-replicating circular DNA 
20 (plasmids) . Where a recombinant microorganism or cell culture 
is described as hosting an "expression vector," this includes 
extrachromosomal circular DNA or DNA that has been 
incorporated into the host chromosome ( s) , or both. Where a 
vector is being maintained by a host cell, the vector can 
either be stably replicated by the cells during mitosis as an 
autonomous structure, or incorporated within the host's 
genome . 

The term "plasmid" refers to an autonomous circular 
DNA molecule capable of replication in a cell, and includes 
both the expression and nonexpression types. Where a 
recombinant microorganism or cell culture is described as 
hosting an "expression plasmid", this includes both 
extrachromosomal circular DNA molecules and DNA that has been 
incorporated into the host chromosome (s) . Where a plasmid is 
being maintained by a host cell, the plasmid is either being 
stably replicated by the cells during mitosis as an autonomous 
structure or is incorporated within the host's genome. 



25 



30 



35 



wo 96/04781 



PCT/US95/10188 



10 



10 



15 



A "heterologous sequence" or a "heterologous DNA 
sequence", as used herein, is one that originates from a 
foreign source (or species) or, if from the same source, is 
modified from its original form. Thus, a heterologous DNA 
encoding sequence operably linked to a promoter is from a 
source different from that from which the promoter was 
derived, or, if from the same source, is modified from its 
original form. Modification of the heterologous DNA sequence 
may occur, e.g., by treating the DNA with a restriction enzyme 
to generate a DNA fragment that is capable of being operably 
linked to the promoter. Modification can occur by techniques 
such as site-directed mutagenesis. 

The phrase "selectively hybridizing to" refers to a 
nucleic acid probe that hybridizes, duplexes or binds only to 
a particular target DNA or RNA sequence when the target 
sequences are present in a preparation of total cellular DNA 
or RNA. "Complementary" or "target" nucleic acid sequences 
refer to those nucleic acid sequences which selectively 
hybridize to a nucleic acid probe. Proper annealing 
conditions depend, for example, upon a probe's length, base 
composition, and the number of mismatches and their position 
on the probe, and must often be determined empirically. For 
discussions of nucleic acid probe design and annealing 
conditions, see, for example, Sambrook et al . , Molecular- 
25 Cloning: A Laboratory Manual {2nd ed. ) , Vols. 1-3, Cold Spring 
Harbor Laboratory, (1989) or Current Protocols in Molecular 
Biology, F. Ausubel et al . , ed. Greene Publishing and Wiley- 
Interscience, New York (1987). 

The phrase "nucleic acid sequence encoding" refers 
to a nucleic acid which directs the expression of a specific 
protein or peptide. The nucleic acid sequences include both 
the DNA strand sequence that is transcribed into RNA and the 
RNA sequence that is translated into protein. The nucleic 
acid sequences include both the full length nucleic acid 
sequences as well as non-full length sequences derived from 
the full length protein. For example, the term "a nucleic 
acid encoding a plant Gp2 protein" includes those nucleic acid 
sequences which encode non-full length amino acid sequences 
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derived from the full-length Gp2 protein. It being further 
understood that the sequence includes the degenerate codons of 
the native sequence or sequences which may be introduced to 
provide codon preference in a specific host cell. 

The phrase "isolated" or "substantially pure" when 
referring to nucleic acid sequences encoding Gp2 proteins, 
refers to isolated nucleic acids that do not encode proteins 
or peptides other than Gp2 proteins. When referring to Gp2 
promoter sequences, the terms "isolated" or "substantially 
pure" refer to isolated nucleic acids that contain promoter 
sequences from a Gp2 gene, but which do not contain promoter 
sequences from other genes. 

The term "plant" includes whole plants, plant organs 
(e.g., leaves, stems, roots, etc.), seeds and plant cells and 
progeny of same. The class of plants which can be used in the 
method of the invention is generally as broad as the class of 
higher plants amenable to transformation techniques, including 
both monocotyledonous and dicotyledonous plants. it includes 
plants of a variety of ploidy levels, including polyploid, 
diploid and haploid. 

The term "transgenic plant" refers to a plant that 
has been produced by genetic engineering techniques. For 
example, plant cells transformed with vectors containing Gp2 
promoters operably linked to heterologous DNA sequences can be 
used to produce transgenic plants with altered phenotypic 
characteristics. 

The following terms are used to describe the 
sequence relationships between two or more nucleic acids or 
polynucleotides: "reference sequence", "comparison window", 
"sequence identity", "percentage of sequence identity", and 
"substantial identity". A "reference sequence" is a defined 
sequence used as a basis for a sequence comparison; a 
reference sequence may be a subset of a larger sequence, for 
example, as a segment of a full-length cDNA or gene sequence 
given in a sequence listing, such as the nucleic acid sequence 
or may comprise a complete cDNA or gene sequence. 

Optimal alignment of sequences for aligning a 
comparison window may be conducted by the local homology 
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algorithm of Smith and Waterman (1981) Adv. Appl . Math. 2:482, 
by the homology alignment algorithm of Needleman and Wunsch 
(1970) J. Hoi. Biol. 48:443, by the search for similarity 
method of Pearson and Lipman (1988) Proa. Natl. Acad. Sci . 
5 (USA) 85:2444, or by computerized implementations of these 

algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin 
Genetics Software Package Release 7.0, Genetics Computer 
Group, 575 Science Dr., Madison, WI) • 

The terms "substantial identity" or "substantial 

10 sequence identity" as applied to nucleic acid sequences and as 
used herein denote a characteristic of a polynucleotide 
sequence, wherein the polynucleotide comprises a sequence that 
has at least 85 percent sequence identity, preferably at least 
90 to 9 5 percent sequence identity, and more preferably at 

15 least 99 percent sequence identity as compared to a reference 
sequence over a comparison window of at least 20 nucleotide 
positions, frequently over a window of at least 25-50 
nucleotides, wherein the percentage of sequence identity is 
calculated by comparing the reference sequence to the 

20 polynucleotide sequence which may include deletions or 

additions which total 20 percent or less of the reference 
sequence over the window of comparison. The reference 
sequence may be a subset of a larger sequence, for example, as 
a segment of the Gp2 gene coding region or promoter regions, 

2 5 both disclosed herein. 

As applied to polypeptides, the terms "substantial 
. identity" or "substantial sequence identity" means that two 
peptide sequences, when optimally aligned, such as by the 
programs GAP or BESTFIT using default gap weights, share at 

3 0 least 80 percent sequence identity, preferably at least 90 

percent sequence identity, more preferably at least 95 percent 
sequence identity or more, 

"Percentage amino acid identity" or "percentage 
amino acid sequence identity" refers to a comparison of the 
35 amino acids of two polypeptides which, when optimally aligned, 
have approximately the designated percentage of the same amino 
acids. For example, "95% amino acid identity" refers to a 
comparison of the amino acids of two polypeptides which when 
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optimally aligned have 95% amino acid identity. Preferably, 
residue positions which are not identical differ by 
conservative amino acid substitutions. For example, the 
substitution of amino acids having similar chemical properties 
such as charge or polarity are not likely to affect the 
properties of a protein. Examples include glutamine for 
asparagine or glutamic acid for aspartic acid. 

DETAILED DESCRIPTION 
This invention provides for isolated plant Gp2 cDNA 
and genomic DNA constructs. This invention also provides for 
isolated plant Gp2 promoters and for DNA constructs containing 
a Gp2 promoter operably linked to heterologous DNA sequences. 
Gp2 promoters may be either Gp2 (short) promoters, which are 
constitutive, or Gp2(long) promoters which are tissue- 
specific. Both forms of Gp2 promoters are useful in the 
production of transgenic plants. Desired phenotypes are 
produced in transgenic plants as a result of transformation 
of plant cells by a DNA construct containing heterologous DNA 
sequence operably linked to a Gp2 promoter. 

Isolati on of Plant Gp2 promoters 

This invention relates to isolated cDNA and genomic 
DNA molecules encoding plant Gp2 proteins. The invention also 
relates to isolated plant Gp2 promoters and recombinant DNA 
constructs containing the Gp2 promoter. These DNA constructs 
include expression cassettes and a variety of vectors. 

1) cDNA and genomic DNA encoding plant Gp2 proteins 
Nucleic acid sequences encoding plant Gp2 proteins 
are typically identical to or show substantial sequence 
identity (determined as described above) to the nucleic acid 
sequence encoding the pepper plant Gp2 protein depicted in SEQ 
ID. No. 1. (Seq. ID No. 1 is a cDNA secpaence encoding a pepper 
plant Gp2 protein.) Nucleic acids encoding plant Gp2 proteins 
will typically hybridize to the nucleic acid sequence of Seq. 
ID No. 1 under stringent conditions. For example, high 
stringency hybridization can be done in buffer containing 50% 
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fonaamide, 10% dextran sulfate, lOX Denhardts, 100 ^g/ml 
salmon sperm, 1% SDS, 50mM NaP04 , and 0.6M NaCl at 42*>C. 
Filters are subsequently washed at 65«C in O.IX SSC, 0,1% SDS. 
Other stringent hybridization conditions may also be selected. 
5 Generally, stringent conditions are selected to be about 5* c 
lower than the thermal melting point (Tm) for the specific 
sequence at a defined ionic strength and pH. The Tm is the 
temperature (under defined ionic strength and pH) at which 50% 
of the target sequence hybridizes to a perfectly matched 
10 probe. Typically, stringent conditions will be those in which 
the salt concentration is at least about 0.02 molar at pH 7 
and the temperature is at least about 60<»C. As other factors 
may significantly affect the stringency of hybridization, 
including, among others, base composition and size of the 
15 complementary strands, the presence of organic solvents and 

the extent of base mismatching, the combination of parameters 
is more important than the absolute measure of any one. 

Plant Gp2 proteins appear to form a multigene (two 
or more) family in pepper plants. Plant Gp2 proteins 
20 typically show substantial sequence identity (as defined 

above) to the amino acid sequence of the pepper Gp2 protein as 
depicted in Seq, ID No. 2. 

Techniques for nucleic acid manipulation of genes 
encoding plant Gp2 proteins such as subcloning nucleic acid 
sequences encoding polypeptides into expression vectors, 
labelling probes, DNA hybridization, and the like are 
described generally in Sambrook, et aJ . , Molecular Cloning - A 
Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York, 1989, which is 
30 incorporated herein by reference. This manual is hereinafter 
referred to as "Sambrook, et ai . " 

There are various methods of isolating DNA sequences 
encoding plant Gp2 proteins. For example, DNA is isolated 
from a genomic or cDNA library using labelled oligonucleotide 
probes having sequences complementary to the sequences 
disclosed herein (see Seq. ID Nos. 1 and 3). Seq. ID No. 1 is 
a cDNA sequence which encodes a pepper Gp2 protein. (The 
predicted amino acid sequence for the pepper Gp2 protein is 



25 



35 



wo 96/04781 



PCT/US95/10I88 



15 



Shown in Seq. ID No, 2.) Seq. ID No. 3 is a genomic DNA 
sequence which includes a short form of a Gp2 promoter, the 
first exon and a part of the first intron. Full-length probes 
may be used, or oligonucleotide probes may also be generated 
by comparison of the sequences of Seq. ID Nos. l and 3. Such 
probes can be used directly in hybridization assays to isolate 
DNA encoding plant Gp2 proteins. Alternatively, probes can be 
designed for use in amplification techniques such as PGR, and 
DNA encoding plant Gp2 proteins may be isolated by using 
methods such as PGR (see below) . 

To prepare a cDNA library, mRNA is isolated from a 
plant tissue which expresses a Gp2 protein. For instance, the 
pericarp tissue of the fruit of a plant can be used. cDNA is 
prepared from the mRNA and then a second, complementary DNA 
strand is synthesized. Subsequently, this duplex DNA molecule 
is ligated into a recombinant vector. The vector is 
transfected into a recombinant host for propagation, screening 
and cloning. Methods for making and screening cDNA libraries 
are well known. See Gubler, U. and Hoffman, B.J. Gene 
25:263-269, 1983 and Sambrook, et ai . 

For a genomic library, typically the DNA is 
extracted from plant tissue and either mechanically sheared or 
enzymatically digested to yield fragments of about 15-20kb. 
The fragments are then separated by gradient centrif ugation 
from undesired sizes and are constructed in bacteriophage 
lambda vectors. These vectors and phage are packaged in 
vitro, as described in Sambrook, et al . Recombinant phage are 
analyzed by plaque hybridization as described in Benton and 
Davis, Science, 196:180-182 (1977). Colony hybridization is 
carried out as generally described in M. Grunstein et al . 
Proc. Natl. Acad, Sci . USA., 72:3961-3965 (1975). DNA 
encoding plant Gp2 proteins is identified in either cDNA or 
genomic libraries by its ability to hybridize with nucleic 
acid probes, for example on Southern blots, and these DNA 
regions are isolated by standard methods familiar to those of 
skill in the art. See Sambrook, et al . See also Example 4, 
herein, for a description of the isolation of a pepper Gp2 
gene. 
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The tera "plant Gp2 gene" or "Gp2 gene" as used 
herein refers to a plant genomic DNA molecule that is the 
entire Gp2 promoter region operably linked to the entire 
coding region (including exons and introns) for the Gp2 
5 protein and may include the adjacent 3» flanking region which 
encodes the 3' non-translated mRNA. The term "plant Gp2 gene 
fragment" or "Gp2 gene fragment" refers to a portion of the 
plant Gp2 gene which is less than the entire promoter and 
coding regions of the gene, A plant Gp2 gene fragment may be 

10 composed of a promoter region operably linked to a portion of 
the coding region of the gene. An example of a plant Gp2 gene 
fragment from pepper is shown in Seq. ID No. 3. 

Nucleic acid amplification techniques such as 
polymerase chain reaction (PGR) technology, can be used to 

15 amplify nucleic acid sequences encoding Gp2 proteins from 

mRNA, from cDNA, and from genomic libraries or cDNA libraries. 
In PGR techniques, oligonucleotide primers complementary to 
the two 3' borders of the DNA region to be amplified are 
synthesized. The polymerase chain reaction is then carried 

20 out using the two primers. See PCR Protocols: A Guide to 

Methods and Applications (Innis, M, Gelfand, D. , Sninsky, J, 
and White, T. , eds.)# Academic Press, San Diego (1990). 
Primers can be selected to amplify the entire regions encoding 
a full-length Gp2 protein or its promoter. PCR can also be 

25 used to amplify smaller DNA segments of these regions as 
desired. 

PCR and related amplification techniques can be used 
in a number of ways to isolate cDNA and DNA molecules encoding 
Gp2 proteins. For example, PCR can be used in a variety of 

30 protocols to isolate cDNAs encoding Gp2 proteins. In these 
protocols, appropriate primers and probes for amplifying DNA 
encoding plant Gp2 proteins are generated from analysis of the 
DNA sequences listed herein. 

Oligonucleotides for use as probes in the above- 

35 mentioned procedures can be chemically synthesized according 
to the solid phase phosphoramidite triester method first 
described by Beaucage, S.L, and Carruthers, M.H,, 1981, 
Tetrahedron Lett., 22 ( 20) : 1859-1862 using an automated 
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synthesizer, as described in Needham-VanDevanter , D.R, , t 
al*, 1984, Nucleic Acids Rgs . , 12:6159-6168. Purification of 
oligonucleotides is by either native acrylamide gel 
electrophoresis or by anion-exchange HPLC as described in 
Pearson, J.D. and Regnier, 1983, J-. Chrom., 255:137-149, 

The sequence of the synthetic oligonucleotide can be verified 
using the chemical degradation method of Maxam, A.M, and 
Gilbert, W. 1980, in Grossman, L. and Moldave, D. , eds* 
Academic Press, New York, MBthods in Enzymology, 65:499-560. 

2^ Plant Gr)2 promoters 

Plant Gp2 promoter sequences are typically identical 
to or show substantial sequence identity (determined as 
described above) to the pepper plant Gp2 promoter nucleic acid 
sequence depicted in SEQ ID- No. 4. Seq. ID No. 4 depicts the 
nucleotide sequence of a long form of the pepper Gp2 promoter. 
Plant Gp2 promoter sequences typically hybridize to the 
nucleic acid sequence of Seq. ID No. 4 under stringent 
conditions. For example, high stringency hybridization can be 
done in buffer containing 50% formamide, 10% dextran sulfate, 
lOX Denhardts, lOOMg/ml salmon sperm, 1% SDS, 50mM NaP04 , and 
0.6M NaCl at 42«C. Filters are subsequently washed at 65*C in 
O.IX SSC, 0.1% SDS. Other stringent hybridization conditions 
such as those described above for cDNA and genomic DNA 
encoding plant Gp2 proteins may also be selected. 

Plant Gp2 promoters can be isolated in a variety of 
ways. Since Gp2 protein is a member of a multigene family, a 
number of different Gp2 promoters having homology or 
substantial sequence identity to the Gp2 promoter sequence of 
Seq. ID. No. 4 may be isolated from pepper plants. In 
addition, Gp2 promoters may be isolated from a variety of 
other plant species, in particular, other SolanacB^e species. 

There are a variety of methods known to those of 
skill in the art which may be used for isolation of plant Gp2 
promoters. For example, plant Gp2 promoters can be isolated 
from genomic DNA fragments encoding a plant Gp2 protein and 
which also contain sequences upstream from the sequence 
encoding the Gp2 protein. Genomic fragments encoding plant 
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Gp2 proteins can be isolated as described above. See Example 
4, herein, for a demonstration of the isolation of Gp2 
promoter sequences from genomic DNA fragments which encode 
pepper Gp2 proteins. 

Plant Gp2 promoter sequences can also be isolated by 
screening plant DNA libraries with oligonucleotide probes 
having sequences derived from the DNA sequence of the pepper 
Gp2 promoter depicted in Seq. ID No. 4. The various 
methodologies described above for isolation of genomic DNA 
fragments encoding a Gp2 protein can also be used for the 
isolation of Gp2 promoters using the Gp2 promoter sequence of 
Seq. ID No. 4. other methods known to those of skill in the 
art can also be used to isolate plant DNA fragments containing 
Gp2 promoters. See Sambrook, et al . for a description of 
15 other techniques for the isolation of DNAs related to DNA 
molecules of known sequence. 

Different forms of a Gp2 promoter can be produced 
that have different properties. For example, short forms of 
Gp2 promoters control constitutive expression of DNA sequences 
that are operably linked to the promoter. Short forms of a 
Gp2 promoter which regulate constitutive expression are 
typically from 250 to 1250 nucleotides in length. An example 
of a short form of a Gp2 promoter 1233 nucleotides in length 
is shown in Seq. ID No, 4. 

Short forms of a Gp2 promoter can be constructed in 
a variety of ways known to those of skill in the art. For 
example, short forms of Gp2 promoters can be constructed by 
mapping restriction enzyme sites in the promoter and then 
using the constructed map to determine appropriate restriction 
enzyme cleavage to excise a subset of the sequence. The 
shorter restriction fragment can then be inserted into a 
suitable vector. The construction of a specific short form of 
a Gp2 promoter which controls constitutive expression of 
operably linked heterologous DNA sequences is shown in Example 
35 4, herein. Other short forms of the Gp2 promoter that are 
constitutive can also be prepared in a similar fashion. 

Short forms of a Gp2 promoter can be shown to 
express operably linked heterologous DNA sequences in a 
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constitutive fashion. This can be done by first preparing a 
vector that has a short form of the Gp2 promoter operably 
linked to an indicator gene. Plant cells are then transformed 
with the vector and transgenic plants are produced from the 
transformed plant cells. Expression of the indicator gene 
under the control of the Gp2 promoter is then determined. See 
Examples 6-12 herein for a demonstration of the constitutive 
expression of a heterologous DNA sequence by the short form of 
a Gp2 promoter. 

Long forms of Gp2 promoters that are tissue-specific 
promoters may also be isolated, using techniques similar to 
those used for isolation of short forms of the Gp2 promoter. 
Long forms of a Gp2' promoter typically contain from 1,500 to 
3,500 nucleotides. An example of a long form of a Gp2 
promoter, containing about 2,700 nucleotides, and which is 
obtained from a pepper plant, is described in Example 16 
herein. 

Long forms of Gp2 promoters can also be constructed 
in a variety of ways. For example, construction of long forms 
of Gp2 promoters that regulate tissue-specific expression of 
operably linked heterologous sequences can be performed by 
using restriction enzymes, as described above for the 
construction of short forms Gp2 promoters. Long forms of Gp2 
promoters can also be constructed by adding additional 
sequences to a short Gp2 promoter, using procedures similar to 
those described in Example 13, herein. 

Long forms of the Gp2 promoter can be shown to 
selectively express operably linked heterologous DNA sequences 
in fruit tissue of transgenic plants. This can be done as 
described above for the short forms of the Gp2 promoter. See 
Examples 14-18, herein, for a demonstration of the operation 
of a long form of a Gp2 promoter in selectively expressing 
operably linked heterologous DNA sequences in transgenic 
plants. 
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&JL Congtruction of vectors containing a Gp2 promotier 

operablv linked to a heterologous PNA secruence 

Once a plant Gp2 promoter region has been isolated, 
various methods may be used to construct long and short forms 
5 of the promoter. These different forms of the Gp2 promoter 
can then be used in expression cassettes, vectors and other 
DNA constructs. A variety of techniques can be used for these 
manipulations of nucleic acids. These techniques are known to 
those of skill in the art and are described generally in 

10 Sambrook, et al . , supra. 

Expression cassettes containing a Gp2 promoter can 
be constructed in a variety of ways. For instance, various 
procedures, such as site directed mutagenesis can be used to 
introduce a restriction site at the start codon of a Gp2 gene 

15 fragment (see Example 5, herein). Then heterologous DNA 
sequences can be linked to the Gp2 promoter such that the 
expression of the heterologous sequences is regulated by the 
promoter. (See Examples 6, 9, and 11 herein for examples of 
procedures useful for operably linking heterologous DNA 

20 sequences to Gp2 promoters.) DNA constructs composed of a Gp2 
promoter operably linked to heterologous DNA sequences can 
then be inserted into a variety of vectors. Such vectors 
include expression vectors that are useful in the 
transformation of plant cells. (See Examples 6, 9, 11, 13 and 

25 16 herein, for a description of the production of several 
vectors containing a Gp2 promoter operably linked to a 
heterologous DNA sequence.) Many other such vectors useful in 
the transformation of plant cells can be constructed by the 
use of recombinant DNA techniques well known to those of skill 

30 in the art. 

C. Prod uction of transgenic plants 

1) Use of DNA co nstructs containing Gp2 promoters 
to produce al tered phenotvpes in transgenic plants 
DNA constructs containing a Gp2 promoter operably 
linked to a heterologous DNA sequence can be used to transform 
plant cells and produce transgenic plants with desired 
phenotypic characteristics. There are a variety of different 
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approaches one can use to produce a desired phenotype in 
transgenic plants. For example, by using methods described 
herein, one can operably link a novel gene to a Gp2 promoter 
and transform plant cells. Transgenic plants can be produced 
from the transformed plant cells so that the novel gene 
product is produced in all tissues or in only certain tissues 
of a transgenic plant. In this context, the term "novel gene" 
refers to a gene that is not normally present in a plant or 
which, if present, is not normally expressed in a particular 
plant cell tissue. The expression of the novel gene can 
result in the production of a protein that confers an altered 
phenotype for a transgenic plant. 

A mutant form of the ALS gene is of interest because 
it can be used in a wide range of plant species as a 
selectable marker in plant transformation protocols. 
Expression of the mutant ALS gene confers resistance to the 
herbicide chlorsulf uron (see Example 9, herein). The 
constitutive form of a Gp2 promoter has been fused to a mutant 
form of the ALS gene. Transformation of tomato and pepper 
plant cells with this DNA construct has been shown to result 
in expression of the ALS gene in the transformed plant cells 
(see Examples 9 and 10, herein). Thus, the mutant ALS gene, 
when introduced into plant cells and constitutively expressed, 
confers a phenotype that is useful in the production of 
transgenic plants. 

A variety of other genes capable of altering a plant 
phenotype can also be expressed with a Gp2 promoter. For 
constitutive expression, the short form of the Gp2 promoter 
can be used operably linked to, e.g., genes for herbicide 
resistance; genes for fungal disease resistance (e.g., 
chitinases and glucanases) ; genes for bacterial disease 
resistance (e.g., cecropins) ; and genes for insect resistance 
(e.g., B. thuringiGnsis toxin). For tissue-specific 
expression, the long form of the Gp2 promoter can be used 
operably linked to, e.g., genes for ripening or degradation 
(e.g., Acc oxidase, Acc synthase, polygalacturonase, phytoene 
synthase); genes for color; or genes for sweetness. 
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One of skill will recognize that proteins have 
different domains which perform different functions. Thus, 
gene sequences operably linked to a Gp2 promoter need not be 
full length, so long as the desired functional domain of the 
protein is expressed. Modified protein chains can also be 
readily designed utilizing various recombinant DNA techniques 
well known to those skilled in the art. For example, the 
chains can vary from the naturally occurring sequence at the 
primary structure level by amino acid substitutions, 
additions, deletions, and the like. These modifications can 
be used in a number of combinations to produce the final 
modified protein chain. 

DNA constructs containing a Gp2 promoter operably 
linked to a heterologous DNA sequence can also be used in a 
number of techniques to suppress expression of endogenous 
plant genes, e.g., sense or antisense suppression. In 
antisense technology, a nucleic acid segment from the desired 
plant gene is cloned and operably linked to a Gp2 promoter 
such that the anti-sense strand of RNA will be transcribed. 
The construct is then transformed into plants and the anti- 
sense strand of RNA is produced. In plant cells, it has been 
shown that anti-sense RNA inhibits gene expression; see, e.g., 
Sheehy et al., Proc. Wat. Acad. Sci . USA, 85:8805-8809 (1988), 
and Hiatt et ai., U.S. Patent No. 4,801,340 which are 
25 incorporated herein by reference. 

The nucleic acid segment to be introduced in 
antisense suppression generally will be substantially 
identical to at least a portion of the endogenous gene or 
genes to be repressed, but need not be identical. The vectors 
30 of the present invention can be designed such that the 

inhibitory effect applies to other proteins within a family of 
genes exhibiting homology or substantial homology to the 
target gene. Segments from a gene can be used (l) directly to 
inhibit expression of homologous genes in different plant 
35 species, or (2) as a means to obtain the corresponding 
sequences, which can be used to suppress the gene. 

The introduced sequence also need not be full length 
relative to either the primary transcription product or fully 
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proc ssed mRNA. Generally, higher homology can be used to 
compensate for the use of a shorter sequence ♦ Furthermore, 
the introduced sequence need not have the same intron or exon 
pattern, and homology of non-coding segments will be equally 
effective. Normally, a sequence of between about 30 or 40 
nucleotides and about 2,000 nucleotides should be used, though 
a sequence of at least about 100 nucleotides is preferred, a 
sequence of at least about 200 nucleotides is more preferred, 
and a sequence of at least about 500 nucleotides is especially 
pref erred • 

Catalytic RNA molecules or ribozymes also have been 
reported to have use as a means to inhibit expression of 
endogenous plant genes. it is possible to design ribozymes 
that specifically pair with virtually any target RNA and 
cleave the phosphodiester backbone at a specific location, 
thereby functionally inactivating the target RNA. In carrying 
out this cleavage, the ribozyme is not itself altered, and is 
thus capable of recycling and cleaving other molecules, making 
it a true enzyme. The inclusion of ribozyme sequences within 
antisense RNAs confers RNA-cleaving activity upon them, 
thereby increasing the activity of the constructs • 

A number of classes of ribozymes have been 
identified. one class of ribozyme is derived from a number of 
small circular RNAs which are capable of self-cleavage and 
replication in plants. The RNAs replicate either alone 
(viroid RNAs) or with a helper virus (satellite RNAs) . 
Examples include RNAs from avocado sunblotch viroid and the 
satellite RNAs from tobacco ringspot virus, lucerne transient 
streak virus, velvet tobacco mottle virus, solanum nodiflorum 
mottle virus and subterranean clover mottle virus. The design 
and use of target RNA-specif ic ribozymes is described in 
Haseloff et al. Mature, 334:585-591 (1988), which is 
incorporated herein by reference. 

A preferred method of suppression is sense 
suppression. Introduction of a nucleic acid configured in the 
sense orientation has been shown to be an effective means by 
which to block the transcription of target genes. For 
examples of the use of this method to modulate expression of 
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endogenous genes see, Napoli et al . , The Plajit Cell 2:279-289 
(1990), and U.S. Patent No. 5,034,323, which are incorporated 
herein by reference. Sense suppression is a preferred method 
for ripening control (e.g., Acc oxidase or Acc synthase), 
5 sweetness control (e.g., ADPG pyrophosphorylase) , or color 
modification (e.g., chalcone synthase); see U.S. Patent 
5,034, 323. 

Generally, in sense suppression, some transcription 
of the introduced sequence occurs. The effect may also occur 
10 where the introduced sequence contains no coding sequence per 
se, but only intron or untranslated sequences homologous to 
sequences present in the primary transcript of the endogenous 
sequence. The introduced sequence generally will be 
substantially identical to the endogenous sequence intended to 
15 be repressed. This minimal identity will typically be greater 
than about 65%, but a higher identity is useful to exert a 
more effective repression of expression of the endogenous 
sequences. Substantially greater identity of more than about 
80% is preferred, though about 95% to absolute identity would 
2 0 be most preferred. The effect may be applied to other 

proteins within a similar family of genes exhibiting homology 
or substantial homology. Segments from a gene can be used (i) 
directly to inhibit expression of homologous genes in 
different plant species, or (2) as a means to obtain the 
corresponding sequences, which can be used to suppress the 
. gene . 

In sense suppression, the introduced sequence, 
needing less than absolute identity, also need not be full 
length, relative to either the primary transcription product 
or fully processed mRNA. A higher identity in a shorter than 
full length sequence compensates for a longer, less identical 
sequence. Furthermore, the introduced sequence need not have 
the same intron or exon pattern, and identity of non-coding 
segments may be equally effective. A sequence of a size of at 
35 least 50 base pairs is preferred, with greater length 

sequences being more preferred; see U.S. Patent 5,03 4,32 3. 
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2> Trans formation of plant cells and production of 
transgenic plants 

Techniques for transforming a vide variety of higher 
plant species are well knovn and described in the technical 
and scientific literature- See, for example, Weising et ai • 
Ann. Rev. Genet. 22:421-477 (1988), which is incorporated 
herein by reference. As described herein, a constitutive or 
inducible Gp2 promoter is operably linked to the desired 
heterologous DNA sequence in a suitable vector. The vector 
comprising a Gp2 promoter fused to heterologous DNA will 
typically contain a marker gene which confers a selectable 
phenotype on plant cells. For example, the marker may encode 
biocide resistance, particularly antibiotic resistance, such 
as resistance to kanamycin, G418, bleomycin, hygromycin, or 
herbicide resistance, such as resistance to chlorsulf uron or 
Basta. Such selective marker genes are useful in protocols 
for the production of transgenic plants. 

DNA constructs containing a Gp2 promoter linked to 
heterologous DNA can be introduced into the genome of the 
desired plant host by a variety of conventional techniques. 
For example, the DNA construct may be introduced directly into 
the genomic DNA of the plant cell using techniques such as 
electroporation and microinjection of plant cell protoplasts. 
Alternatively, the DNA constructs can be introduced directly 
to plant tissue using ballistic methods, such as DNA particle 
bombardment. In addition, the DNA constructs may be combined 
with suitable T-DNA flanking regions and introduced into a 
conventional AgrroJbacterium tumefaciens host vector. The 
virulence functions of the Agrobacterium tumefaciens host will 
direct the insertion of the construct and adjacent marker into 
the plant cell DNA when the cell is infected by the bacteria. 

Microinjection techniques are known in the art and 
well described in the scientific and patent literature. The 
introduction of DNA constructs using polyethylene glycol 
precipitation is described in Paszkowski et ai . , Embo J. 
3:2717-2722 (1984), Electroporation techniques are described 
in Fromm et ai . , Proc , Natl. Acad. Sex. USA 82:5824 (1985). 
Ballistic transformation techniques are described in Klein et 
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al., Nature 327:70-73 (1987). The full disclosures of each of 
these references are incorporated herein by reference. 

Another method is high velocity ballistic 
penetration by small particles with the nucleic acid either 
within the matrix of small beads or particles, or on the 
surface (Klein et al . , (1987) Nature 327:70-73. Although 
typically only a single introduction of a new nucleic acid 
segment is required, this method particularly provides for 
multiple introductions. 

Agrobacterium tumefaciejis-meditated transformation 
techniques are well described in the scientific literature, 
see, for example Horsch et al . Science (1984) 233:496-498, and 
Fraley et ai . (1983) Proa. Natl. Acad. Sci . USA 80:4803, both 
Of Which are incorporated herein by reference. see Examples 
7, 9, 11, 14, and 17, herein, for a demonstration of the 
transformation of plant cells with a vector comprising a Gp2 
promoter by Agrobacteri ujn tumefaciens. 

More specifically, a plant cell, an explant, a 
miristem or a seed is infected with Agrobacterium tumetaciens 
transformed with the segment. Under appropriate conditions 
known in the art, the transformed plant cells are grown to 
form shoots, roots, and develop further into plants. The 
nucleic acid segments can be introduced into appropriate plant 
cells, for example, by means of the Ti plasmid of 
Agrobacteriu;n tuinefaciens. The Ti plasmid is transmitted to 
plant cells upon infection by Agrobacterium tumefaciens, and 
is stably integrated into the plant genome (Horsch et al . 
(1984) "Inheritance of Functional Foreign Genes in Plants'" 
science, 233:496-498; Fraley et a2 . , (1983) Proc. Nat'l. Acad. 
Sci. U.S.A. 80:4803. 

Ti plasmids contain two regions essential for the 
production of transformed cells. One of these, named transfer 
DNA. (T DNA), induces tumor formation. The other, termed 
virulent region, is essential for the introduction of the T 
DNA into plants. The transfer DNA region, which transfers to 
the plant genome, can be increased in size by the insertion of 
the foreign nucleic acid sequence without its transferring 
ability being affected. By removing the tumor-causing genes 
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SO that they no longer interfere, the modified Ti plasmid can 
then be used as a vector for the transfer of the gene 
constructs of the invention into an appropriate plant cell, 
such being a "disabled Ti vector". 

All plant cells which can be transformed by 
Agrobacterium and whole plants regenerated from the 
transformed cells can also be transformed according to the 
invention so as to produce transformed whole plants which 
contain the transferred foreign nucleic acid sequence • 

There are various ways to transform plant cells with 
Agrohacterium , including : 

(1) co-cultivation of Agrobacterium with cultured 
isolated protoplasts, 

(2) transformation of cells or tissues with 
AgrobactBrium , or 

(3) transformation of seeds, apices or meristems 
with AgrobaatGrium . 

Method (1) requires an established culture system 
that allows culturing protoplasts and plant regeneration from 
cultured protoplasts. 

Method (2) requires (a) that the plant cells or 
tissues can be transformed by AgrobactBrium and (b) that the 
transformed cells or tissues can be induced to regenerate into 
whole plants. 

Method (3) requires micropropagation . 

In the binary system, to have infection, two 
plasmids are needed: a T-DNA containing plasmid and a vir 
plasmid. Any one of a number of T-DNA containing plasmids can 
be used, the only requirement is that one be able to select 
independently for each of the two plasmids. 

After transformation of the plant cell or plant, 
those plant cells or plants transformed by the Ti plasmid so 
that the desired DNA segment is integrated can be selected by 
an appropriate phenotypic marker. These phenotypic markers 
include, but are not limited to, antibiotic resistance, 
herbicide resistance or visual observation. Other phenotypic 
markers are known in the art and may be used in this 
invention. 
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The present invention embraces use of the claimed 
promoters in transformation of any plant, including both 
dicots and monocots. Transformation of dicots is described in 
references above. Transformation of monocots is known using 
5 various techniques including electroporation (e.g., Shimamoto 
et al.. Nature (1992), 338:274-276; ballistics (e.g., European 
Patent Application 270,356); and Agrobacterium (e.g., Bytebier 
et ai., Proc. Nat'l Acad. Sci. USA (1987) 84:5345-5349). 

Transformed plant cells which are derived by any of 
10 the above transformation techniques can be cultured to 
regenerate a whole plant which possesses the desired 
transformed phenotype. Such regeneration techniques rely on 
manipulation of certain phytohormones in a tissue culture 
growth medium, typically relying on a biocide and/or herbicide 
15 marker which has been introduced together with the Ph 

nucleotide sequences. Plant regeneration from cultured 
protoplasts is described in Evans et al . , Protoplasts 
Isolation and Culture, Handbook of Plant Cell Culture, pp. 
124-176, MacMillan Publishing Company, New York, 1983; and 
20 Binding, Regeneration of Plants, Plant Protoplasts, pp. 21-73, 
CRC Press, Boca Raton, 1985, all of which are incorporated 
herein by reference. Regeneration can also be obtained from 
plant callus, explants, organs, or parts thereof. Such 
regeneration techniques are described generally in Klee et al . 
25 Ann. Rev. of Plant Phys . 38:467-486 (1987), which is 
incorporated herein by reference. 

One of skill will recognize that, after an 
expression cassette comprising the Gp2 promoter is stably 
incorporated in transgenic plants and confirmed to be 
operable, it can be introduced into other plants by sexual 
crossing. Any of a number of standard breeding techniques can 
be used, depending upon the species to be crossed. 

The expression of the heterologous DNA sequences 
linked to a Gp2 promoter can be detected in a variety of ways, 
depending on the nature of heterologous sequences. For 
example, one may assay for the desired phenotype. The desired 
phenotype which results from the successful expression of 
heterologous DNA sequences under control of a Gp2 promoter may 
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be determined by a variety of ways, depending on the 
phenotypic trait that is introduced. For instance, resistance 
to a herbicide can be detected by treatment with the 
herbicide . 

Expression of the heterologous DNA can also be 
detected by measurement of the specific RNA transcription 
product. This can be done by, for example, RNAse protection 
or Northern blot procedures (see Examples 8, lo, 12, 15, and 
18 herein) , If heterologous DNA sequences encode a novel 
protein, the protein product may be assayed, for instance, by 
its function or by a variety of immunoassay techniques. For 
example, a novel protein product with enzymatic activity can 
be measured in an enzyme assay (see examples 8 and 15, 
herein) . 

The methods and compositions of the invention have 
use over a broad range of types of plants, including species 
from the genera Fragarla, Lotus, Medicago, Onobrychls, 
Trlfolxum, Trigonella, Vigna, Citrus, Linum, Geranium, 
Manihot, Daucus, Arabldopis, Brassiaa, Raphanus, slnapis, 
Atropa, Capsicum, Hyoscyamus, Lycopersicon, Nicotiana, 
Solanum, PBtunia, Digitalis, Majorana, Ciohorium, Helianthus, 
Lactuca, Bromus, Asparagus, Antirrhinum, Herecocallis, 
Nemesia, Pelargonium, Panlcum, Pennlsetum, Ranunculus, 
seneclo, Salplglossis, Cucumls, Browaalla, Glycine, Lollum, 
Zea, Trlticum, Sorghum, Datura, Chrysanthemum, Dianthus, 
Gerhera, Euphorbia, Ipomoea, Passiflora, Cyclamen, Malus, 
Prunus, Rosa, Rubus, Populus, Santalum, Allium, Lilium, 
Narcissus, Ananas, Arachis, Phaseolus and Pisum, and more 
particularly including oil crops such as canola (Brassica 
sp.), cotton (Gossypium sp.), peanut (Arachis sp.), sunflower 
(Helianthus sp.), palm (fTIaeis sp.), flax (Linum sp.), 
saf flower (Carthamus sp. ) , coconut (Cocos ap.) and soybean 
(Glycine sp.); grain crops such as wheat (Trlticum sp.), corn 
(Zea sp.), sorghum (Sorghum sp.), barley (Hordeum sp.), rye 
(Secale sp.), ©ats (Avena sp.) and rice (Oryza sp.); fruit 
crops such as banana (Musa sp.), citrus (Citrus sp.), berries 
(e.g., strawberry (Fragaria Sp.) or raspberry (Rubus sp.), 
mango (Mangifera sp. ) , melon (Cucumis sp.), pear (Pyrus sp.). 
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cucumber (Cucuais sp.) , and apricot, peach, cherry, plum and 
prune (Prunus sp.); vegetable crops such as pea (Pisum sp.), 
bean (Vicia sp.), broccoli and related crucifers (Brassica 
sp.), spinach (spinacia sp.), onion (Allium sp.), celery 
(Apium sp.), carrot {Daucus sp.), asparagus (Aspara^s sp.), 
and artichoke (Helianthus sp.); tomato (Lycopersicon 
esculentum) , pepper (Capiscum annuum) ; additional ornamental 
crops such as tulip (Tulipa sp.), snapdragon (Antirrhinum 
sp.). Iris (Iris sp.). Orchids (Cymbidium and Cattleya sp.), 
pelargonium; beverage crops such as coffee (Coffea sp.) and 
tea (Thea sp.); herb crops such as mint (Mentha sp.), thyme 
(Thymus sp.) and marjoram (Origanum sp.). 

The following examples are provided by way of 
illustration and not limitation. 



EXAMPLES 

Example 1; Isolation and cha racterization of Gd2 cDNA 

clones from pepper plants 

Gp2 CDNA was obtained by the differential screening 
of a green pepper pericarp cDNA library with green fruit cDNA. 
The selected cDNA clone was subsequently shown to hybridize 
very strongly to pepper pericarp mRNA (from both green and red 
fruit) but to be absent in mRNA from leaf, root, flowers etc. 

In order to isolate cDNA, pepper plants (Capsicum 
annum cv D2XYBA) were grown under greenhouse conditions. 
Poly (A)* RNA was isolated from mature green fruit, red fruit 
and leaf (Dunsmuir et al . , eds., (1988) Plant Molecular 
Biology Manual, S. Gelvin and R. Schilperoovt , Kluwer Academic 
Dordrecht, pp. i-i?) . First strand cDNA was synthesized from 
the poly (A)+ RNA (extracted from both mature green fruit and 
red fruit) using oligo dT as the primer and reverse 
transcriptase. The second strand was synthesized using DNA 
polymerase I and RNAse H according to the manufacturers 
instructions (Amershara Little Chaflont, U.K.). The cDNA was 
modified with EcoRI methylase to inhibit digestion of internal 
EcoRi sites then ligated to EcoRI linkers. After cutting back 
with EcoRI to remove excess linkers, size selection was done 
using an acrylamide gel or Sephacryl S4 00 column and only 
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those CDNAs greater than 500 bp were used. The double 
stranded cDNA was cloned into the EcoRl site of the lambda - 
ZAP vector (Stratagene, San Diego, California, USA) and 
packaged in vitro using Gigapack packaging extracts 
(Stratagene) . 

10,000 recombinant phage from the mature green fruit 
CDNA library (2000/85mm plate) were plated using E. coli 
strain C600. The phage plagues were transferred onto 
nitrocellulose filters (filters were made in triplicate) and 
the phage DNA was denatured in 1.5 M NaCl, 0.5 N NaOH and 
neutralized in 1.5 M NaCl, 0.5 M Tris-HCl, pH8 . The filters 
were screened with radiolabeled cDNA made from red fruit 
mRNA, mature green fruit mRNA and leaf mRNA. Hybridization 
conditions and washes were essentially as described in 
Sambrook, et aJ . , (1989) Molecular Cloning: a Laboratory 
Manual, Second Edition, Cold Spring Harbor Laboratory, New 
York. 

Twelve plaques were selected which hybridized 
strongly with mature green fruit and red fruit cDNAs and not 
With leaf CDNA. DNA was purified from these selected phage for 
further analysis. The cDNA inserts were isolated from the 
recombinant phage by gel purification after restriction 
digestion with EcoRI . The inserts were subcloned into the 
EcoRI site of the plasmid pGEM7 (Promega Corporation, Madison, 
Wisconsin, USA) . 

The plasmid clones were first used in cross- 
hybridization experiments to determine if they were related to 
each other. 0.5-1 ^g of EcoRI digested plasmid DNA from each 
CDNA clone was run in parallel on an agarose gel, transferred 
to a Duralon nylon membrane (Stratagene, San Diego, 
California, USA) and hybridized to nick translated 32p labeled 
probes prepared from each of the cDNA clones. Plasmid 
cloning. Southern transfer, probe preparation and 
hybridizations were done as described in Sambrook et al . , 
supra (1989). Based upon the cross hybridization pattern, the 
clones were placed into four separate homology groups numbered 
Groups 1 to 4. The longest cDNA from each homology group was 
then used as a probe in Northern hybridizations to total RNA 
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isolated from leaf, root, green fruit and red fruit, and as a 
probe for slot blots prepared from total RNA isolated from 
pepper fruit at different stages of development (days after 
anthesis) . For Northern blots, 10 of total RNA was 
5 separated on a 1.5% agarose/ formaldehyde gel and transferred 
to a nylon membrane, lOug of total RNA was treated in 6% 
formaldehyde/ 50% formamide at 50 C for 60 minutes before 
loading onto the nylon filter. The Northern blots and slot 
blots were prepared and hybridized to ^^P nick-translated cDNA 
10 probes as described in Sambrook et al . , (1989), supra, cDNAs 
in group l (Gpl) and group 3 (Gp3) hybridized to fruit and 
root RNA while those in group 2 (Gp2) and group 4 (Gp4) gave 
fruit specific hybridization. The Gp2 clones hybridized to a 
700-800 base transcript which was expressed primarily in 
15 mature fruit (see Figure 1), The Gp4 clones hybridized to a 
900-1000 base transcript which was expressed at moderate 
levels throughout fruit development. In view of these 
expression profiles, all subsequent analysis was confined to 
the Gp2 CDNA clones. The mature green fruit cDNA library and 
the red fruit cDNA library was rescreened with the pGP23 cDNA 
in order to isolate and characterize related cDNAs for the 
purpose of establishing whether multiple genes were being 
transcribed in the mature fruit tissue or whether all 
transcripts derived from a single gene. The hybridization 
25 patterns indicated that the Gp2 transcripts were derived from 
genes which were part of a small multigene family in the 
pepper genome (See Figure 2), 

Once the cDNA clone was demonstrated to be expressed 
at a high level in the fruit, the nucleotide sequence of the 
3 0 CDNA was determined. Sixteen independent Gp2 cDNAs were 

isolated from the lambda ZAP libraries. The cDNA inserts were 
excised from the lambda clones as EcoRl fragments and these 
were recloned into the vector BS(+) (Stratagene, San Diego, 
California, USA) which is suitable for preparation of single 
strand templates for gene expression and sequencing studies. 
These clones were sequenced from both the 5' and 3» end by the 
Sanger dideoxy chain termination method. All 16 clones were 
found to be identical in primary sequence except for the 
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position of the polyadenylation sites, and the number of A 
residues in A-tracks at the extreme 5' end of the clones. A 
common feature of plant genes is that transcripts from a 
single gene will have variability in the polyadenylation site; 
5 see Dean et al , , (1986), Nucl . Acid. Res. 14:2229-2240. 

Transcripts from a single gene may also have extreme 5* end 
differences in the number of A residues in A-tracks; this is a 
common artifact of cDNA synthesis. These data suggest that 
the Gp2 transcripts arise from a single gene. When the 16 

10 clones were aligned, pGP13 had the longest 3' and pGP50 had 

the longest 5« sequence; pGP13 was completely sequenced; pGPSO 
was sequenced across the 5 ' end which overlapped with pGP13 . 
The composite cDNA sequence is shown in Seq. ID No. 1. 

The pGPl3 sequence was translated in three frames 

15 and the longest open reading frame (beginning with the first 

ATG) is designated as such in Seq. ID No. 1. The cloned cDNA 
fragment encodes a 75 amino acid polypeptide which is highly 
enriched in the sulphur containing amino acids. This 
predicted amino acid sequence is shown as Seq. ID No. 2. A 

20 comparison of the predicted protein sequence to sequences in 
EMBL Genbank shows some homology to proteins similar to the 
Bowman Birk proteinase inhibitors, as well as thionins, and a- 
amylase inhibitors . 



Example 2; Genomic DNA encoding Gp2 proteins in pepper 

Plants 

Genomic DNA encoding Gp2 proteins was identified in 
pepper plants by Southern blot analysis. Genomic Southern 
analysis was performed as described by Sambrook et al., supra 
1989. Genomic pepper DNA was isolated from cultivar D2XYBA 
according to the method of Dooner et al., (198 5) Mol . Gen. 
Genet. 200:240-246 and digested with EcoRI, Hindlll, and Xbal. 
The digested DNA was separated on an agarose gel and 
transferred to Duralon membrane (Stratagene, San Diego, 
California, USA) . The membrane was hybridized to a ^^P in 
vitro labeled RNA riboprobe prepared from pGP2 3 (Gp2 cDNA 
clone) . High stringency hybridization was done in buffer 
containing 50% formamide, 10% dextran sulfate, lOX Denhardts, 
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lOO/ig/ml salmon sperm, 1% SDS, 50mM NaP04 , and 0.6M NaCl. at 
42**C. Filters were subsequently washed at SS^C in O.IX SSC, 
0.1% SDS. 

5 Example 3; Expression of Gp2 genes in Pepper Plants 

To determine the distribution of Gp2 gene expression 
in pepper, RNAse protections and primer extensions were done 
on total RNA isolated from different pepper tissue. A 294 bp 
internal Hindi fragment from pGpl3 was purified on an agarose 

10 gel and cloned into the BS(+) vector (Stratagene San Diego, 

California, USA) to form the plasmid pGpl3-l. This clone was 
used in -the preparation of in vitro labeled riboprobe for 
RNAse protection experiments and screening of the genomic DNA 
library (see below) . A 20 base oligonucleotide shown below, 

15 beginning l9bp downstream of the translation start was 
prepared for the primer extension analysis. 

5, CTACTTTGGAAAAGCCAGCC 3' (Seq, ID No. 5) 
Primer extensions were done on 10 Mg of total RNA isolated 
from leaf, stem, petiole, root, seed, peduncle and flower and 

2 0 pepper fruit from different developmental stages according to 
the method described by Dunsmuir et al . , eds. , (1988) Plant 
Molecular Biology Manual, S. Gelvin and R, Schilperoovt , 
Kluwer Academic Dordrecht, pp, 1-17). RNAse protections which 
are at least ten times more sensitive than primer extensions, 

2 5 were done on the same samples essentially as described by 

Melton et al . , (1984) Nucl Acids Res 12:7035-7056. The data 
from these experiments are shown in Figure 3. Transcripts 
with homology to the Gp2 cDNA are absent from leaf, root, 
petiole, stem and flower, but are present at low level in seed 

3 0 and peduncle, and at very high levels in developing fruit. 

Example 4: Isolation of a gene from pepper plants encoding 

GP2 protein and containing the Gp2 promoter 
In order to isolate the genomic DNA region 
3 5 corresponding to the Gp2 gene, a genomic library was prepared 
from pepper DNA, and the Gp2 cDNA clone was used to screen the 
genomic library. The coding region from the cDNA was then 
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aligned against the genomic region in order to identify the 
promoter region of the gene. 

Pepper plants were grown in the greenhouse and 
nuclear DNA was isolated from young leaves as described in 
5 Dunsmuir, et a J . (1983) J. Mol . Appl . Genet., 2, 285-300, The 
DNA was partially digested with Sau3A followed by size 
fractionation on a glycerol gradient as described in Sambrook 
et ai., (1989), DNA in the 15-20 kb range was cloned into the 
BamHI site of the lambda DASH II cloning vector (Stratagene, 

10 San Diego, California, USA) followed by in vitro packaging 
with Gigapack extracts (Stratagene) and plating in coli 
strain LE392. Six hundred thousand recombinant phage clones 
were screened with a Gp2 (pGP2 3) riboprobe and three 
independent phage which hybridized with the probe were 

15 isolated, 

DNA prepared from the three genomic phage (#4, #12, 
#20) that hybridized to the Gp2 cDNA were digested with EcoRI , 
Hindlll, Spel, and Ncol and restriction maps were generated- 
The restriction maps are shown in Figure 4. The clones 
overlap and appear to be derived from a single genomic region. 
Hybridization of the Gp2 cDNA probe to the phage DMAs 
localized the Gp2 coding region as shown in Figure 4 by a 
boxed area. An expression cassette was prepared using the 6.1 
kb Spel fragment from phage #4. This fragment, which included 
the Gp2 coding sequence and approximately 2 . 5 kb of 5 • 
noncoding sequences and 2 . 0 kb of 3 • noncoding sequences was 
subcloned into the Spel site of BS(+) (Stratagene, San Diego, 
California, USA) to give plasmid PEP4-2. 

The plasmid PEP4-2 consists of 6,1 kb of pepper 
30 genomic DNA. A more detailed restriction map was generated 

with the enzymes Apal , Ncol, Sad, Bglll and PvuII (see Figure 
5). Precise localization of the Gp2 gene coding region within 
this fragment was achieved by sequence analysis of regions on 
the genomic clone which hybridized to the cDNA and alignment 
of these sequences with the known Gp2 cDNA sequence. There 
was complete agreement between the sequences of the genomic 
region and the cDNA clone except for the presence of an intron 
in the genomic DNA. The regions sequenced corresponded to 111 
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bp of the 5» end of the cDNA, including 47 bp of untranslated 
sequences and 54 bp of coding region, and approximately 3 00 bp 
of the 3' untranslated region. These data indicated that the 
genomic region cloned in phage #4 is the genomic equivalent of 
the Gp2 cDKAs and that the Gp2 transcripts present in fruit 
derive from this gene. Hence the DNA from PEP4-2 was used for 
the promoter isolation and preparation of an expression 
cassette. 

pxample 5; Preparation of an expression cassette 

containing a short form of the Gp2 promoter 

A short form of the Gp2 promoter was produced using 
a restriction fragment of the Gp2 promoter region obtained 
from the PEP4-2 clone. Subsequently, site directed 
mutagenesis was used in order to introduce a restriction site 
at the translation start codon for the Gp2 protein. 

From the PEP4-2 clone, described in Example 4, 
herein, the 2 . 1 kb EcoRI - Pvull fragment (see Figure 5), 
containing 1.2 kb of the 5' untranslated region and 900 bp of 
coding and intron sequences, was subcloned into the EcoRI-Smal 
site of BS(-) vector (Stratagene) to give rise to plasmid 
PEP4-6 (Figure 6). The 1.2 kb promoter region which is 
represented in this new clone called PEP4-6, corresponds to 
Gp2 (short) promoter. Site directed mutagenesis was used to 
introduce a restriction site at the position of the 
translation start to facilitate fusions to this promoter. The 
sequences surrounding the translation start of the Gp2 coding 
region were mutagenized according to the method of Kunkel 
(1985) Proc, Nat'l. Acad. Sci , 82: 488-492 in order to 
introduce a Ncol site. The oligonucleotide which was used for 
this purpose is shov/n below: 

5' GCCAGCC&IGGTAATATTTTTGG 3' (Seq. ID No. 6) 
Ncol 

The mutant clone PEP4-6mut was subsequently used as the source 
of a Gp2 (short) promoter fragment. 

The sequence of pepper genomic DNA in PEP4-6 clone 
was determined by the Sanger dideoxy method and is shown as 
Seq. ID No. 3. The promoter region of this sequence is shown 
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as Seq. ID No, 4. The sequence of the PEP4-6 clone is 
identical to the sequence of the cDNA clone in the regions 
which they have in common/ namely the transcription leader in 
the cDNA and the 5 ' end of the coding region up to where the 
intron interrupts the gene coding sequence (bases 1170 to 
1295) • The cDNA clone is identical to the sequence in the 
PEP4-2 clone throughout the entire coding region, and 
approximately 300 bp downstream of the stop codon. 

Example 6: Product ion of a vector containing a short form 

of the Gp2 pro moter operablv linked to the uigfA 
gene 

The short form of the Gp2 promoter produced in 
Example 5 was operably linked to the uidA gene, in order to 
examine the expression of DNA sequences linked to this form of 
the Gp2 promoter. The ^-glucuronidase (uidA) reporter gene 
(GUS) was fused to the Gp2 (short) promoter by digestion of 
the PEP4-6mut clone with Ncol and Hindlll and replacement of 
the Gp2 coding region with the Ncol - Hindlll fragment 
containing the uidA coding region and nos terminator from the 
plasmid PJJ3411 (see Jones et al . , (1992) Transgenic J^esearch 
1: 285-297). The newly created plasmid was called PEP4-6GUS. 

To insert the promoter/uidA fusion gene into the 
binary vector pJJ2964 (Jones et al . (1992) supra) the plasmid 
PEP4-6GUS was digested with BamHI and Hindlll and the DNA 
fragment containing the Gp2 (short) /uidA/nos 3* fusion was gel 
purified followed by ligation into the BamHI-Hindlll site of 
PJJ2964. The newly formed binary vector JJ2964/PEP4-6GUS was 
transferred by triparental mating with coli strain 
HB101/PRK2013 (Figurski and Helinski (1979) Proc Natl Acad Sci 
76:1648-1652) into the A. tumefaciens strain LBA4404 (Hoekema 
et ai., (1983) Nature 303: 179-180) which contains the 
virulence region of the Ti plasmid. 
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Example 7; Transfo rmation of plants with a vector 

conta irking a shor t form of the Gd2 promoter 
operablv linked to the uidA aene 

5 U Preparation of an Aar obacterium strain carrying 

the PJJ2964/PEP4-6 GUS nlasmid 
Fresh cultures of A. tumefaclens LBA 4404 
harboring plasnid pAL4404 were grown at 28"C for 24 h from a 
single colony in minimal A. E. coli HBlOl containing the 

10 plasmid pRK2013 and E. coli HBlOl containing pJJ2964/PEP4-6GUS 
prepared as described in Example 6 were grown for 6 hours in L 
broth. 0.5 ml of the A. tumefaciens culture were mixed on the 
same LB agar plate with 0.25 ml of HB101/pRK2013 and 0.25 ml 
of HBlOl containing pJJ2964/PEP4-6GUS and the plates were 

15 incubated for 24 hours at 28 'C. A loopful of bacteria from 
each plate was then resuspended in 1 ml minimal A and plated 
at 10°, 10*2 and lO"* dilutions on LB plates containing lOO 
Mg/ml rifampicin and 1 ng/ml tetracycline.. After several days 
growth at 28 "C, individual colonies were restreaked onto 

2 0 minimal A plates containing lug/ml tetracycline. Subsequently 
the Agrobacterium strain which carried the pJJ2964/PEP4-6GUS 
plasmid was used in the transformation of tobacco and tomato. 

2J Transfo rmation of tobacco plants 

2 5 All manipulations described below were carried out 

under sterile conditions. A culture of the Agrobacterium 
stain containing the requisite binary plasmid was grown for 24 
hours in minimal A medium at 28 'C. Leaf discs were punched 
from the leaves of SRI Nicotiana tabacum plants grown under 
30 sterile conditions (Maliga et al . (1973) Nature New Biol, 

244:29-30) using a cork borer with internal diameter 0.5 cm. 
The Agrobacterium cultures were diluted in MS/B5 0.1 to a 
final concentration of 5 x loVml- The leaf discs were dipped 
into the Agrobacterium suspension for 1 second each and then 
placed on co-cultivation plates, with 10 discs per plate. Co- 
cultivation plates were solid MS/B5 0.1/1.0 overlaid with a 
single, sterile Whatman #1, 7 cm filter disc. The leaf discs 
were co-cultivated with the AgrroJbacteriujn for 2 days at 28 "C. 
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The co-cultivation was terminated by washing the leaf discs in 
MS/B5 0.1/1.0 containing 500 ^g/ml cefotaxime for 6 hours with 
one change of medium. The discs were then placed on solid 
MS/B5 0-1/1.0 containing 500 ^g/ml cefotaxime and incubated 
for 3 days at 28 'C. The discs were then transferred to 
selective medium: MS/B5 O.l/l.o containing 500 ng/ml 
cefotaxime and lOO ng/ml kanamycin. The discs were cultured 
on this medium under high light flux with a 16 hour light, 8 
hour dark regime at 2 6«C. After 2-3 weeks, kanamycin- 
resistant callus and subsequently kanamycin-resistant shoots 
begin to appear. When these were sufficiently large to handle 
with sterile forceps, they were transferred to rooting medium 
(MS 0.1/1.0 with no naphthalene acetic acid or 6- 
benzylaminopurine, containing 2 urn indole butyric acid) 
without selection, and allowed to form roots for 14 days. 
They were then excised and retransf erred to rooting medium 
containing 100 tig/ml kanamycin. Plants that successfully 
developed roots on this medium were all transf ormants. They 
were then transferred to magenta boxes containing solid MS 
0.1/1.0 without naphthalene acetic acid or 6-ben2yl 
aminopyrine . 



2J Transfo rmation of tomato plants 

All manipulations were carried out under sterile 
conditions. A culture of Agrobacterium LBA4404 containing the 
PJJ2964/PEP4-6GUS plasmid was grown for 24 hours in minimal A 
medium at 28 -C. Explants were excised from the midsections of 
7-8 day old cotyledons of sterilely grown L. esculentum 
seedlings on germination medium. The Agrobacterium culture 
was diluted in liquid 2% glucose-Oms to a final concentration 
of 5X lo^/ml. 

The explants were submerged in the Agrrobacteriua 
suspension for 20-30 minutes and then placed on cocultivation 
plates for 2 days at 2 5 0C. The cocultivation was terminated 
by washing the leaf discs in liquid 2% glucose-Oms for 2 
hours. The explants were then placed on solid regeneration 
media and cultured under high light fluence with an 8 hour 
dark period at 25»C, After 10 days kanamycin resistant callus 
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appeared and then small shoot buds by 3 w eks. At 5 weeks the 
healthy calli and shoots were transferred to fresh selection 
regeneration medium and within 2 weeks many transformed shoots 
emerged. The shoots were then excised and transferred to 
5 selective rooting medium. Plants that successfully developed 
shoots in this medium in 6-10 days were all transf ormants. 
These plants were then transferred to non-selective rooting 
medium for 2 weeks to check for residual Agrobacter ium^ then 
transplanted to soil. 

10 

Example 8: Measurement of uidA gene expression in 

transformed plants 
An advantage of using the /3-glucuronidase marker 
gene is that it is feasible to measure either RNA levels or 

15 enzyme activity in order to monitor promoter activity. In 
this example, GUS RNA levels were measured by the RNAse 
protection method in tomato plants- In addition, enzyme 
activity was measured using the substrate 4 -methyl 
umbelliferyl glucuronide (MUG) in transformed tobacco plants. 

20 RNAse protection and/or primer extensions were done 

to quantitate the level of RNA in the different tissues of the 
transgenic plants. A riboprobe containing approximately 90 bp 
of uidA coding sequence beginning at the ATG was used in the 
RNAse protection assays. As described by Melton et ai • , 

25 (1984), supra, 10 /xg of total RNA isolated from transgenic 

tobacco leaf, stem and flower was hybridized to ^^P in vitro 
labeled RNA riboprobe. The hybrids were RNAse A/RNAse Tl 
treated and the digestion products (protected fragments) 
separated on an acrylamide gel. Six independent primary 

30 tobacco transf oirmants were analyzed by RNAse protection and 
found to have similar uidA miRNA levels present in leaf, stem 
and flower. See Table 1 below. The positive control used in 
these experiments was a tobacco transforroant carrying the 
CaMV35S:uidA construct, and the levels of mRNA expression 

35 measured in the leaf of this plant was 

For Tables 1, 2, 5A, 5B, 6, 8 below, the 
measurements are recorded on the following ascending 6-step 
scale (with approximately equal intervals between steps) : -, 
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+++, With the first step (dash) being not 

detected and the last step (quadruple plus) being highest 
level. Where a blank is shown, no measurement was made. 



t^^foL^H ^?""^«?«"^ of uldA mRNA levels in transgenic tobacco 
transformed with Gp2f abort ^ /uldA 



Trans formant 
Gp2,CUS-l 

-2 

-3 

-4 

-7 

-8 



Leaf 
+++ 



Stem 



Flower 



RNA levels were determined in RNAae protection experiments. 

Six independent tomato transf ormants were found to 
have similar uxdA mRNA levels in red (r) and green (g) fruit 
and leaf tissue (Table 2). Total RNA was isolated from a 
wider range of different tissues in three of the tomatb 
transformants and analyzed by primer extension analysis 
(Dunsmuir et ai . , 1987) using a 20 bp oligonucleotide 
positioned 13 bp downstream of the translation start in the 
uidA gene. The sequence of the primer which was used is shown 
below: 

5- GATTTCACGGGTTGGGGTTT 3" (Seq. ID No. 7) 
Using the primer extension assay uidA mRNA was detected in 
flower, petiole, stem, peduncle, calyx and seed (seed 
expression was seen in only one individual) . The expression 
level of the introduced Gp2 (short) /uidA gene was similar among 
the different tissues. See Table 2 below. The positive 
control plant used in these experiments was a tomato plant 
transformed with the CaMV3 5S:uidA construct, and the measured 
UidA RNA level in leaf was ++. 



TABLE 2; 



.r.nsfor.ea ^ttrSS^jLSSa"^^ '^^""'^ transgenic .o.ato 



Trano f 
Gp2-204 

- 210 

- 305 

- 308 

- 310 

- 321 



ZzX^ leaf flow, petl. etem 



ped. 



calv . 



seed root 



RNA levels were determined by primer extension. 
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GUS assays were done on protein extracts from three 
of the PEP4-6GUS tobacco transf ormants essentially as 
described in Richardson et al . , (1987) EMBO 6: 3903-3907. 
Extracts from leaf, pod, stem, seed, root and whole flower 
5 tissue were analyzed and GUS activity was detected in all 

tissues. The highest activity per mg total protein was found 
in the seed, followed by root and leaf, stem, flower and pod. 
The expression of GUS activity in the Gp2 (short) /uidA 
transf ormants was similar to the levels observed in a 
10 transf ormant with a CaMV3 5S/uidA gene which is considered to 
be constitutively expressed. (See Table 3 below) 



TABLE 3: Heaaurement of GUS activity in controle and transgenic tobacco 
plants transformed with the Gp2 ( short ) /uicfA gene, 
(pmoles 4-inethylumbellif erone/min/mg protein) 





Plant 




Flower 


Leaf 


Stem 


Seed 


Pod 


Root ^ 


20 


Untrans . 




1.1 


0.2 


0,3 


3.4 


0,57 


3.2 




Gp2s/ui<fA 


/a 

#2 
/3 


1.1 
2.2 
2.2 


763.7 
935. 9 
683.3 


60.0 
324.4 
507.7 


265.1 
3503.0 
3225, 7 


1.57 
1.68 
1.88 


935.2 
930.5 
911,6 


25 


35S/uidA 




6.7 


391.6 


102.7 


3329. 1 


1.53 


423.5 



When the Gp2 (short) promoter is fused to the uidA gene, the 
levels of GUS mRNA or GUS activity are similar between 
different tissues within the transgenic plant. The data 
presented in tables 2 and 3 indicate that the Gp2 (short) 
promoter used in these experiments directs a constitutive 
Pattern of expression in tomato and tobacco plants. 

Example 9; Transformation o f plants with a vector containing 
a short form of the Gp2 promoter ooerablv linked 
to the ALS gene 

Data have also been obtained regarding the 
expression of the acetolactate synthase gene (SuRB or ALS) 
after fusion to the Gp2 (short) promoter. The ALS gene which 
was used is a mutant form which confers resistance to the 
herbicide chlorsulf uron (see Lee et ai . , (1988) EMBO J. 
7;1241-1248) . WTT2132 (Figure 7) is a binary vector 
containing the following components: the tetracycline 
resistance (tetR) gene, fragments containing the replicon from 
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pACycl84 (Chang and Cohen (1978) J. Bacteriol 134: 1141-H56) 
and pVSI (Itoh et al . , (1984) Plasmid 11: 206-220). In 
addition there is the 35SCab22L promoter (Harpster et ai . , 
(1988), Molecular and Gen. Genetic, 212:182-190) fused to the 
tobacco surB gene (Lee et al . , (1988), supra), and the lacZa 
fragment of the B-galactosidase gene (Yanisch-Perron et 
al., (1985) Gene 33: 103-119) located between the left and 
right T-DNA borders from the A. tumefaciens octopine Ti- 
plasmid (van den Elzen et al., (1985) Plant Mol Biol 5: 149- 
154). Bglli linJcers were added at the BamHI site in the PEP4- 
6GUS plasmid such that the Gp2 promoter (short) could be 
isolated on a 1.2 )cb Bglll-Ncol fragment. This fragment was 
substituted for the Bglli-NcoI fragment which carries the 
35SCab22L promoter in WTT2132, forming the binary vector 
WTT2179 (Figure 7). This plasmid was subsequently transferred 
to A. tumefaciens LBA4404 (Hoekema et ai . 1983, supra) by 
triparental mating. 

•LL Transfo nnation of tomato plant-*; 

Tomato plants were transformed with A. tumefaciens 
containing the WTT2179 plasmid as described in Example 7, 
above . 

21 Transformati on of pepper plants 

The protocol for the transformation of pepper is 
as described in Example 3 of U. s. Patent 5,262,316, except 
that the explants were not washed after cocultivation, 
initiation medium was lOBIcscf and maturation medium was B4cs 
Shoots were rooted by transfer to Sorbarod plugs (Baumgartnen 
Papiers SA, Switzerland) soaked in liquid YRM (see below) . 
Shoots were confirmed for transformation using the recallusing 
assay on OMSG plus 0.5mg/l kinetin and 0.5mg/l 2,4-D, and by 
PGR using the oligonucleotides DEALS 1 and DEALS2 as described 
for pea below. 
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N6 salts and iron^^^ 1/2X 

Thiamine 1 mg/l 

5 Sucrose 3 0 g/1 

lAA* 2 mg/l 

pH 5.7 



*Add lAA after autoclaving 

See Chu et al . , (1975) ScientiA Sinicet 19:659-668 

3) Transformation frequencies for tomato and 

The relative transformation frecjuencies for both 
tomato and pepper transformation were compared for the 
Gp2 (short) /ALS selectable marker and the CaMV35S/ALS 
selectable marker* The transformation frequencies listed by 
species in Table 4 below. 



20 TABT.K 4: 

A. TOMATO 

Transformation frequency (#rooted tranef ormanta/# explants) and 
escape frequency (unrooted shoot #/total # shoots) comparison between 
25 Gp2 { short )/SurB or CaMV35S/SuRB selectable markers 

Binary Selectable Transformation Escape 

Vector Marker Frequency Frequency 

WTT2084 35S/AI-S 28/160 31/60 

30 WTT2132 35S/ALS 19/160 27/46 

WTT2179 Gp2(9)/ALS 21/160 15/36 



35 



40 



45 



B, PEPPER 

Transformation frequency (#rooted transf ormants/^ explants) and % 
explante shooting (# shoots/# cocultivated) comparison between 
Gp2{ short )/SurB or CaMV35S/SuRB selectable markers 



Binary 
Vector 

2104/CldlO 

WTT2132 

WTT2179 



Selectable 
Marker 

35S/ALS 
35S/ALS 
Gp2 ( 8 } /ALS 



explants 
shooting 

409/7900 
140/1480 
89/1880 



Transform. 

13/7900 (.38%) 
14/1480 (.95%) 
3/1880 (.37%) 



Example 10: Measurement of ALS aene expression iq 

transformed plants 
ALS gene expression was measured in the 
transformed tomato and pepper plants produced in Example 9. 
ALS RNA levels were measured in both leaf and fruit tissue. 
RNAse protection analysis (Melton et ai . , 1984, Supra) with a 
riboprobe containing 90 bp of the 5' coding sequence of the 
SurB gene (Lee et al , , 1988, supra) was used to quantitate the 
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level of ALS RNA in the transgenic tomato and pepper plants. 
7 tomato plants and 2 pepper plants transformed with WTT2179 
binary vectors were analyzed and found to have ALS RNA in both 
leaf and fruit tissue. These data illustrate that Gp2 (short) 
functions to direct expression of the SuRB gene in multiple 
tissues of transgenic tomato and pepper. These data are also 
consistent with that observed for the uidA gene fused to the 
Gp2 (short) promoter. See Table 5, below. The positive 
control plants used in these experiments were tomato or 
(pepper) plants transformed with CaMV3 5S:ALS, where the 
measured ALS mRNA level in leaf was 



TABLE 5A: Measurement of ALS RNA express ion in tomato plants 
transformed with Gp2 ( short ) -At-S : RNAse protection analysis 



Transformant # Young leaf 



Old leaf 



Red fruit 



AllOl 
A1102 
A1103 
A1104 
A1202 
A1203 
A1204 



+ 



c^?^Ko!Z. ^^^"^^s^^^n^ent of ALS RNA expression in pepper transformed with 
Gp2( short) -ALS or CaMV35S-ALS: RNAse protection with ALS probe 



Transformant / leaf 

lAcOla 
lAc02a 

lAc03a CaMV3SS -•-♦^.-^ 
lAc04a CaMV35S + 



green fruit 



red fruit 



The Gp2 (short) promoter has been used to drive the 
ALS gene as a selectable marker and found to perform like the 
CaMV3 5S promoter in this regard. RNAse protection has been 
used to directly compare the relative levels of ALS expression 
of the CaMV35S promoter and the Gp2 (short) promoter fusions in 
transgenic tomato and pepper tissue. The relative 
transformation frequencies achieved using the CaMV35S promoter 
and the Gp2 (short) promoter fused to ALS have been quantitated 
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as a measure of relativ strength of the two promoters • As 
shown in Table 5B, the strengths of the two promoters is 
similar « 

5 Example 11; Transformation of tomato Plants with a 

vector containing a short form of the Gp2 
promoter operablv linked to the AccS gene 
Tomato plants were also transformed with a vector 
containing a short form of the Gp2 promoter fused to an AccS 
10 gene fragment. The AccS gene fragment encodes ACC synthase. 

The plasmid 3 5SAccS contains the 35S Cab22L 
promoter (Harpster et al . , 1988, supra) fused to a fragment of 
the fruit specific ACC synthase gene, Acc2 (see Rottman et 
ai., (1991) cT. Mol, Biol. 222: 937-961), The PCR reaction 
15 with oligonucleotides located 160 bp downstream from the 

translation start and 210 bp upstream from the stop codon was 
used to generate a 5' and 3' truncated version of the gene. 
The PCR fragment was ligated, in frame, to the 35SCab22L 
promoter and the nos3 ' terminator. The plasmids 3 5SACCS and 
2 0 PEP4-6GUS were both digested with Ncol and Xbal then the 1.1 
kb AccS fragment from 35SACCS and the 4 • 3 kb fragment frbm 
PEP4-6GUS containing the BS(+) vector, the Gp2 (short) promoter 
and the nos3 • end were gel purified, ligated together and 
transformed into E. coli strain MV1193 to form PEP4-6AccS 
25 (Figure 8) , 

PEP4-6ACCS was digested with BamHI and Clal and 
the ends made flush with Klenow enzyme. The fragment was gel 
purified and ligated into the Smal site of both WTT2179 or 
WTT2132 (see above) to form WTT2179/PEP4-6AccS and 
30 WTT2132/PEP4-6ACCS. Orientation of the PEP4*6AccS fragment 
relative to the right border was determined by restriction 
digest. Subsequent transfer of the plasmid to A. tumefaaiens 
strain LBA4404 (Hoekema et al . , 1983, supra) was by 
triparental mating. 

Tomato plants were transformed with A, tumefaciens 
LBA4404 containing either the WTT2 179/PEP4-6 ACCS or WTT2132 
PEP4-6 ACCS plasmids as described in Example 7, above. 
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Eyqmple t?: Measurem ent of AccS aene eypr*.sgion in 

transformed tomato plants 
AccS RNA levels were measured in leaves of 13 
transgenic tomato plants produced in Example 11, by RNAse 
protection assays. A riboprobe protection fragment was 
prepared from 35SAcc2FL which consists of a 3 5SCab22L promoter 
(Harpster et al . , 1988, supra) fused to a full length PCR 
generated Acc2 (Rottman et al . , 1991, supra) coding sequence 
and the Jios3 • end. The Xhol - EcoRl fragment which carried 
the Cab22L and the first 285 base pairs of the Acc2 coding 
sequence was cloned into the Sall-EcoRI site of BS(+) 
(Stratagene San Diego, California, USA) and used as the probe 
in protection experiments. AccS transcripts were detected in 
leaf tissue in 7 of 11 of the WTT2179/PEP4-6AccS transf ormants 
analyzed and in 2 out of 2 WTT2132/PEP4-6AccS transf ormants 
(data not shown). This analysis indicated that the Gp2 (short) 
promoter directed expression of AccS in leaves of tomato. 

Example 13; Preparation of a v ector containing ;i 1 opg 

form of the Gp2 p romoter fused to the uirfa 
gene 

A long form of the Gp2 promoter was prepared by 
fusing an additional 1.6 kb from the Gp2 promoter to a short 
form of the Gp2 promoter. The plasmid PEP4-2 (Figure 9) was 
partially digested with Apal and the 5.6kb fragment containing 
the BS(+) vector and 2.6kb of the 5' untranslated region 
(promoter) upstream of the Apal site that is closest to the 
ATG, was then ligated to the Apal fragment from PEP4-6GUS 
which includes the 5* untranslated region between the Apal 
site and the Ncol site of Gp2 (short) promoter fused to the 
uidA coding region and the ;3os3 • . This newly formed plasmid, 
PEP4-2GUS (Figure 9) was digested with Spel, the ends filled 
in with Klenow, followed by digestion with Hindlll . The 4 . 6kb 
fragment carrying the Gp2(long) promoter fused to the uidA 
gene and nos3 ' end was gel purified and. cloned into the Hpal- 
Hindlli sites of JJ2964 (cited above) to create JJ2964/PEP4- 
2GUS. 
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Plasinid PEP4-2GUS was deposited according to the 
terms of the Budapest Treaty with the American Type Culture 
Collection (ATCC) of 12301 Parklawn Dr., Rockville, Maryland 
USA on June 15, 1994 and assigned Accession No.: 75816, If 
5 the plasmid PEP4-2GUS is digested with Ncol and PvuII, the 1.2 
kb fragment which is released can be used as a constitutive 
promoter. The 1.2 kb fragment corresponds to Sequence ID No. 
4. If the plasmid PEP4-2GUS is digested with Ncol and Spel, 
the 2.8 kb fragment which is released can be used as a tissue- 
10 specific promoter. 

A Gp2(long) promoter fused to the uidA gene and 
the Gp2 3' end was made by first introducing a Xbal site 17 bp 
downstream of the Gp2 stop codon by site directed mutagenesis 
(Kunkel, 1985) with a 18 bp oligonucleotide: 
15 5' GAAAG TCTAGA TGGATATAG 3'. (Seq. ID No. 8) 

Xbal 

The Xbal-Hindlll fragment from this clone, PEP4-2Xbamut , which 
carried the Gp2 3' end was cloned into the partially digested 
Xbal-Hindlll complete site of PEP4-2GUS to form PEP4-2GUS3'. 

20 To clone this fusion into the binary vector JJ2964, the 

plasmid PEP4-2GUS3 • was digested with Sad and the ends filled 
in using T4 Polymerase. The DNA was then digested with 
Hindlll. The fragment containing the Gp2(long) promoter fused 
to the uldA gene and Gp2 3' was ligated into JJ'2964 at the 

25 Hpal-Hindlll site forming the plasmid pJJ2964/PEP4-2GUS3 * 

which was transformed into E, coli strain JM83. Both binary 
vectors JJ2964/PEP4-2GUS and JJ2964 /PEP4-2GUS3 ' were 
transferred into A. tumeraciens LBA4404 (Hoekema, (1985, 
supra) by triparental mating. 

30 

Example 14: Transformation of tomato plants with a 

vector containing a long form of the Gp2 
promoter fused to the uidA gene 
Tomato plants were transformed with A. tum^fQciens 
35 LBA4404 containing either the JJ2964/PEP4-2 GUS or JJ2964/PEP- 
2GUS3 • plasmids, as described in Example 7, above. 
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gxainpj.e IS: Measure ment of uidA gene expression ir^ 

transformed plants 
uidA mRNA levels were measured in 15 independent 
tomato transf ormants which carry either the 
5 Gp2 (long) /UidA/ J50S3 • gene (9 plants) or the 

Gp2 (long) /uidA/Gp2-3 • gene (6 plants). In all but one of 
these plants the levels of uidA mRNA is significantly higher 
in red (r) fruit than in leaves indicating that the long 
version of the Gp2 promoter directs tissue specific expression 
10 compared with the short version which directs constitutive 

expression. See Table 6, below. The positive control plants 
used in these experiments were tomatoes transformed with the 
CaMV35S: uidA construct, where the uidA RNA level in leaf was 



15 



25 



35 



40 



TABLE 6: Measurement of uidA mRNA levels in transgenic tomato 

transformed with Gp2 ( long) /uidA 

Transformant Fruit-r Leaf 

20 4.2uidA-7201 {^) 

-7202 i^) 

-7205 

-7206 

-7207 ^ 
-7210 (♦) 
-7211 

-7218 (*) (♦) 

-7220 



30 4.2uidA, 3*-12101 

-12103 



-12106 
-12107 
-12111 

-12112 ++ 



GUS activity was also measured as described in 
Richardson et al , , (1987, supra). Tissue from leaf, stem, 
root, flower, seed, and green (g) and pink (p) fruit was 
assayed in 3 transgenic plants transformed with 2964/PEP4-2GUS 
and 3 with JJ2964/PEP4-6GUS3 • . Three additional JJ2964/PEP4- 
6GUS3' transf ormants were analyzed for activity in leaf and 
pink fruit only. Activity was detectable in all tissues in 
45 JJ2964/PEP4-2GUS transf ormants with the highest level in pink 
fruit followed by stem, root and green fruit, flower, seed and 
leaf. The JJ2964/PEP4-2GUS3 • transf ormants had the same 
profile except that root had higher activity than green fruit 
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and stem. In general, activity with the Gp2 3* end was lower 
than with the nos3 • end. See Table 7 below. 



10 



15 



20 



TABLE 7: Measurement of GUS activity in transgenic tomato plants 

transformed with the Gp2 ( long) /uidA gene (pmoles 4- 
methylumbellif erone/min/mg protein) . 



Trans. / 




stem 


root 


flower 


fruit-a 






-7202 


25 


2170 


1807 


176 


569 


3569 


44 


-7206 


17 


1406 


449 


16 


702 


3746 


36 


-7220 


101 


831 


310 


39 


1395 


5938 


20 


-12101 


21 


186 


1276 


5 


474 


4328 


73 


-12106 


6 


411 


1937 


7 


656 


2195 


73 


-12111 


1 


5 


241 


1 


48 


1491 


16 


-12108 


40 










1607 


-12110 


55 










3311 




-12113 


1 










238 




no uldA 


1 


1 


1 


1 


5 


7 


<1 


2SS/uldA 


537 


nd 


nd 


nd 


nd 


5388 


241 



The above data illustrate that the Gp2(long) 
promoter directed significantly higher levels of expression of 
the UldA gene in pink fruit than in other tissues. 
Furthermore, the Gp2(long) promoter directs a different 
pattern of expression from the CaMV35S promoter. 

Two different forms of a Gp2(long) promoter :uidA 
gene fusions have been compared. In one form, the nos 
terminator is used and in the other the Gp2 gene 3 • region is 
used. Significant differences between the performance of 
these constructions in transgenic tomatoes have not been seen* 
Both forms direct much higher levels of expression in fruit 
than in leaves. 



Example 16: Prepara tion of a vector containing a long 

form of the Gp2 promoter fused to the cell 
gene 

The pepper ceil gene was isolated by screening a 
red pepper pericarp cDNA library (prepared in the lambda gtlO 
vector) with two different degenerate oligonucleotides which 
had been designed to correspond to a conserved 5' and 3' 
region within the published coding sequences for the avocado 
mesocarp and the bean abscission layer cDNAs. The sequences 
of these oligonucleotides are given below: 
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Oligo il 5 ' TTA/GTCICCIGCA/GTCA/GTAA/GTAICCICC 3 • 
(Seq. ID No. 9) 

Oligo #2 5* TCCATA/GTCT/CTCIGGICGT/CTCCCAA/GCA 3* 
(Seq. ID No. 10) 

At least twenty independent cDNA clones were 
isolated using this screening method and each of these was 
characterized by restriction enzyme analysis and also direct 
sequence analysis. Based upon this characterization, all such 
cDNA clones were determined to have derived from the 
transcription of a single gene, cell. in order to prepare the 
cell coding region for reintroduction and expression in 
plants, the coding region was modified by the addition of 
restriction enzyme cleavage sites at the translation start and 
adjacent to the translation stop. These sites allowed for the 
simple excision of the coding region from the cDNA clone and 
the subsequent fusion of the cell coding region to appropriate 
transcriptional control regions. Specifically an Ncol site 
was introduced at the translation start using oligonucleotide 
site directed mutagenesis with the following oligonucleotide: 
Oligo #3 5' ATATATATAACCATGGCTTG 3' (Seq. ID No. 11) 

Ncol 

Then a Bgl ii site was introduced adjacent to the translation 
stop using the oligonucleotide: 

Oligo#4 5' AAATTGAAGATCTATAGTC 3 • (Seq. ID No. 12) 

Bglll 

The resultant plasmid was designated pCP35M (Figure lO) . AS* 
deletion of the cell coding region was isolated from the 
plasmid PCP3 5M by partial digestion with Ncol and complete 
digestion with Bglll. This deletion of 240 bases results in a 
coding region which specifies a protein which is missing 80 
amino acids at the NHj terminal. This fragment was then 
inserted into the plasmid PEP4-2GUS which had been digested 
with Ncol and BamHl. The resulting plasmid, pG2-ciAl0, was 
composed of a deleted form of the cell coding region fused to 
the Gp2(long) promoter at the 5' end, and the nos terminator 
at the 3' end (Figure lO) . 

The deleted cell coding region flanked by the 
Gp2(long) promoter and the nos terminator was isolated after 
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Bglll and Hindlll digestion of th plasmid pG2-ClAI0 followed 
by the generation of blunt termini with Klenow enzyme, then 
inserted into the binary vector pWTT213 2 which had been 
digested with Smal . The resulting plasmid WTT2 132/G2-C1a10 
carried the cell gene with the Gp2(long) promoter and nos 
terminator adjacent to the ALS gene selectable marker 
(CaMV35S/SuRB) for plant cell transformation and between the 
left and right borders of the T-DNA which directs insertion 
into the plant genome during transformation (Figure 10) , 



Ex^inpXe ^7: Transformation o f pepper plants wi-bh a 

vector containing a long form of the Gp7 
promoter fused to the^?ce"I^l^aene 
Pepper plants were transformed with A. tumefaciens 
15 containing the WTT2134 G2-C1 delta 10 plasmids, as described 
in Example 9, above. 

gxample ^8: Measurement of cell gene expression in 

transformed pepper plants 

20 Expression of the introduced cell gene was 

measured by direct quantitation of mRNA levels using the RNAse 
protection method, A probe fragment (which encompassed 3 30 
bases of the pepper ceil coding region) was prepared from 
PCP3 5-6L, which is a subclone from the pcell gene. This probe 

25 detected the introduced gene as a 260 base pair fragment and 
the endogenous gene as a 3 00 base fragment. Four independent 
transf ormants which express the cell transgene have been 
characterized (See Table 8 below) . 



TABLE 8: Measurement of cell mRNA levels in transgenic pepper plants 

transformed with Gp2 ( long ) -cei J measured by RNAse protection 

Transgene Endogenous 

Transformant # Leaf Fruit Leaf Fruit 

ICOla + . 

lC07a - . 

lC13a _ ^ 

lC20b ^ , 



In three of the transf ormants the level of the ceil mRNA is 
much higher in fruit tissue than in leaf tissue. This is in 
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contrast to the situation for CaMV35S/cell transf ormants . We 
evaluated CaMV35S/ceil transf ormants in 6 plants and found 
similar levels of cell RNA in both leaf and fruit tissue (data 
not shown) . 

Overall, the Examples demonstrate that the Gp2 
promoter is effective in both its short form (constitutive) 
and long form (tissue-specific) for controlling expression of 
a variety of genes (GUS, ALS, ACC synthase. Cell) in various 
plant species (tobacco, tomato, pepper) • 

The above examples are provided to illustrate the 
invention but not to limit its scope. Other variants of the 
invention will be readily apparent to one of ordinary skill in 
the art and are encompassed by the appended claims. All 
publications, patents, and patent applications cited herein 
are hereby incorporated by reference. 
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(i) APPLICANTS: BADEN, Catherine S,, DUNSMUIR, Pamela, 
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(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

<D) SOFTWARE: Patentin Release #1*0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US not yet assigned 

(B) FILING DATE: 

(C) CLASSIFICATION: 

<viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Dow, Karen B. 

(B) REGISTRATION NUMBER: 29,684 

(C) REFERENCE /DOCKET NUMBER: 12176-4 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (415) 543-9600 
(B> TELEFAX: (415) 543-5043 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: cDNA 

( ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 46.. 270 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAAATTATTA TTTGAAAAAA AAAAAAAAGA TCCAAAAATA TTATT ATG GCT GGC 

Met Ala Gly 
1 



TTT TCC AAA GTA CTT GCA ACT ATT TTT CTT ATG ATG TTC CTG GTT TTT 
Phe Ser Lye Val Val Ala Thr lie Phe Leu Met Met Leu Leu Val Phe 
5 10 IS 
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OCT ACT GAT ATG ATG CCG GAG GCA AAG ATC TGC GAG GCG TTG ACG CGC ISO 
Ala Thr Asp Met Met Ala Glu Ala Lye lie Cys Clu Ala Leu Thr Civ 
20 25 30 35 

AAC TTC AAG GGG TTG TGC CTT AGT AGC CGC GAT TGT GGT AAT GTT TGC 198 
Aen Phe Lys Gly Leu Cys Leu Ser Ser Arg Asp Cys Gly Asn Val Cya 
40 45 50 

CGT AG A GAG GGA TTT ACC GAT GGC TCT TGC ATT GGA TTC CGT CTT CAA 246 
Arg Arg Glu Gly Phe Thr Asp Gly Ser Cya lie Gly Phe Arg Leu Gin 
55 60 65 

TGC TTC TGC ACG AAG CCC TGT GCT TAATTAACTC TTGAGAGGTG AAAGTCTCGA 300 
Cys Phe Cye Thr Lys Pro Cys Ala 
70 75 

TGGATAGATT GAAAAAACAT AAATTACTAT GAATTAATGA GTATTTTATA CTTTGTTGTT 360 

CTGCTTTTAT TTGTCATGAA ATAAAGACCA TTTCGATTAA TGCTTGCTAT GGAAAAAAAC 420 

CTGTTGTCAA CTTTTGATTG TAAGTTTTTT GTTTCCAAGG TTGTTATCTA AAGTATTGTA 480 

TCGTGTTGTT AGTTTAAATA TTTATGTTGA TTGTTAGTTC AATTTTATAT TGTATTGTGT 540 

CTGTAAATTA TATTATTATG TCACGATTAA AAGTCTTATT TTGATGTGAT GATC 594 

(2) INFORMATION FOR SEQ ID N02 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Gly Phe Ser Lys Val Val Ala Thr lie Phe Leu Met Met Leu 
^ S 10 15 

Leu Val Phe Ala Thr Asp Met Met Ala Glu Ala Lys He Cya Clu Ala 
20 25 30 

Leu Thr Gly Asn Phe Lys Gly Leu Cys Leu Ser Ser Arg Asp Cys Gly 
35 40 45 

Aan Val Cys Arg Arg Glu Gly Phe Thr Asp Gly Ser Cys He Gly Phe 
50 55 60 

Arg Leu Gin Cys Phe Cys Thr Lys Pro Cys Ala 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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( ix ) FEATtniE : 

(A) NAME/KEY: CAAT eignal 

(B) LOCATION: IIOOT.1103 

(ix) FEATURE: 

(A) NAME/KEY: TATA_Bignal 

(B) LOCATION: 1139.. 1146 

(ix) FEATURE: 

(A) NAME/KEY: miscf eature 

(B) LOCATION: 1169 

(D) OTHER INFORMATION: /note« -Transcriptional etart Bite" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1172 

(D) OTHER INFORMATION: /note- -pCPSO 5' «nd- 

(ix) FEATURE: 

(A) NAME/KEY: miac feature 

(B) LOCATION: 12347.1236 

(D) OTHER INFORMATION: /note= -Translation start codon" 

( ix ) FEATURE : 

(A) NAME/KEY: miecf eature 

(B) LOCATION: 1298 

(D) OTHER INFORMATION: /note- "Intron start site" 

(ix) FEATURE: 

(A) NAME/KEY: misc^f eature 

(B) LOCATION: 1..1233 

(D) OTHER INFORMATION: /note- -Gp2 promoter sequence- 

(ix) FEATURE: 

(A) NAME/KEY: miscfeature 

(B) LOCATION: 1,,1727 

(D) OTHER INFORMATION: /note- "Gp2 Genomic DNA clone" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



TTTTAACTGT 


GATTTTACTA 


TATCGTCCCT 


AATTAATCAT 


TATAGGACAT 


TTATGTATTA 


60 


AAAAGTCAAT 


AATATTTAAT 


TAAAAATAGA 


TTACTATAAT 


ACCCCCTATC 


ATTACTATAT 


120 


TACCCCTAAT 


TGCTCCGTGA 


TTTACTATAT 


TATCCCTAAT 


TAATTATTAT 


AAGCCATTAT 


180 


ATATTATAAT 


TAATAATATA 


TTTATTATAT 


TATCTCTAAT 


TAATTATTAT 


AACTCATTTA 


240 


TATATTAGAA 


TTAATAATAT 


TTAATTAAAA 


TAGATGTCAT 


CTCTGCCTAT 


ATTACCCCTA 


300 


ATTGCTCCAT 


AATTTACTGT 


ATTACCCCTA 


ATTAATTGTT 


ATAAGTCATT 


TATATATTAG 


360 


AATTACTAAT 


ATTTTTAATA 


CTATATTACC 


CCTAATTAAT 


TATTATAAGT 


CATTTGCTTT 


420 


CGATCGCTGC 


TTCATCATTT 


ACTATTTACC 


CCTAATTTCT 


CCATGATTTA 


CTCTATTACC 


480 


CCTTATTAAT 


TATTATAAGT 


CATTGATATA 


TTAGAATTAA 


TAATATATTT 


TTATTATATT 


540 


ACCCCTAATT 


AATTATTATA 


AGTCATTTAT 


ATATTAGAAT 


TAATAATATT 


TAATTAAAAA 


600 


AATACATATT 


TATGACGTTT 


GTTTCAGCTA 


CTTTAACCGT 


GATTTTACTA 


TATCATCCTT 


660 


AATTAATTAT 


TATAAGTCAT 


TTATTTATTA 


AAAAATTAAT 


AATATATAAT 


TAAAAAATAG 


720 


ATGACATGTC 


TGCCTATATT 


ACCCCTAATT 


AATTATTATA 


AGTCATTTAT 


TTATTAAAAA 


780 


ATTAATAATA 


TATAATTAAA 


AAATTGATGA 


GACATACTTG 


AACTTTAGGA 


GGGTCATATT 


840 


ACCCCTAGAT 


TAATTAAAAT 


CGTATTTTTG 


ACAACCTTAG 


TGCCTACGTG 


GACCTTCAGT 


900 
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CTCTTCATTT 


ACTCACGTAC 


CACTGTTGGG 


AGCACTACCT 


CCATAATGCG 


CCCTACAACG 


960 


CGCGATTCAG 


ATTAGTTGAC 


ACTCCAATAC 


GGACATCAAA 


AATAGATGGA 


AAACCGAAAA 


1020 


AAAACAAAAT 


AAAAATAATG 


TGTGTTTAAT 


TTTCATTTGT 


AGATTAGTCA 


AAGTTATACA 


1080 


CTCACAAATT 


CATGATAATC 


AACTTTTTCA 


TCATACATGG 


CTACGCTTTT 


TGAAAATCTA 


1140 


TATATATTTA 


GTAGTGGCGT 


CTTCATTCTT 


TACCCACATA 


CAAGAAA7AC 


AAATTATTAT 


1200 


TTGAAAAAAA 


AAAAAAGATC 


CAAAAATATT 


ATTATGGCTG 


GCTTTTCCAA 


AGTAGTTGCA 


1260 


ACTATTTTTC 


TTATGATGTT 


CCTGGTTTTT 


GCTACTGGTA 


ATTCATTTTC 


TTTTCTTTTT 


1320 


TTCCAAAGTA 


GTTCAACTAA 


TGGTATGACG 


ATATTTAATA 


CAACAACATA 


TCAAGTACTA 


1380 


AAATTTCACA 


AGTGAGGTCT 


GAAAAAGGTA 


AAGTGTATAC 


AGAATATATA 


TATAAAAACT 


1440 


ATTTCTTCAC 


TCGTTTGAAC 


TCTTATGTTA 


AATTTAAAGG 


CGGATTCTGC 


AAGTTCAACT 


1500 


AATGCTATGA 


CGCTATTTAA 


TACAACAATA 


ATATGTCAGT 


AGTAAATTTT 


CACGAGTGAG 


1560 


GTCTGAGAAA 


GGTAAAGTGT 


ACATACGTTT 


AATTTTACCA 


TTACTTCCGA 


GCAGTGGCGG 


1620 


AACCAGAAAT 


TTGATGAACA 


TAGTGTAAAT 


TCCAGTTGCT 


TCAAGGGTGT 


GCAAAATTAA 


1680 


ATATATACTC 


ATAATAGCCA 


ACATTTAACC 


TATATACACA 


ATACCAG 




1727 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

( ix ) FEATURE : 

(A) NAME /KEY: CAAT_signal 
<B) LOCATION: 1100. •IIOS 

(ix) FEATURE: 

(A) NAME /KEY: TATARS ignal 

(B) LOCATION: 1139,. 1146 

( ix ) FEATURE : 

(A) NAME/KEY: miBC_f eature 

(B) LOCATION: 1169 

(D) OTHER INFORMATION: /note« "Transcriptional start site" 

(ix) FEATURE: 

<A) NAME/KEY: raise feature 
(B) LOCATION: 1172" 

(D) OTHER INFORMATION: /note= "pGPSO 5" end- 

(ix) FEATURE: 

(A) NAME/KEY: raise feature 

(B) LOCATION: 1 . , 1?33 

(D) OTHER INFORMATION: /note- "Pepper Gp2 promoter sequence 
up to the ATG start codon." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

TTTTAACTGT GATTTTACTA TATCGTCCCT AATTAATCAT TATAGGACAT TTATGTATTA 60 

AAAACTCAAT AATATTTAAT TAAAAATAGA TTACTATAAT ACCCCCTATC ATTACTATAT 120 
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TACCCCTAAT 


TGCTCCCTGA 


TTTACTATAT 


TATCCCTAAT 


TAATTATTAT 


AAGCCATTAT 


180 


ATATTATAAT 


TAATAATATA 


TTTATTATAT 


TATCTCTAAT 


TAATTATTAT 


AAGTCATTTA 


240 


TATATTAGAA 


TTAATAATAT 


TTAATTAAAA 


TAGATCTCAT 


GTCTGCCTAT 


ATTACCCCTA 


300 


ATTG CTCC AT 


AATTTACTGT 


ATTACCCCTA 


ATT AATTG TT 


ATAAGTCATT 


TATATATTAG 


360 


AATTACTAAT 


ATTTTTAATA 


CTATATTACC 


CCTAATTAAT 


TATTATAAGT 


CATTTCGTTT 


420 


CGATCGCTGC 


TTCATCATTT 


ACTATTTACC 


CCTAATTTCT 


CCATGATTTA 


CTCTATTACC 


480 


CCTTATTAAT 


TATTATAAGT 


CATTGATATA 


TTAGAATTAA 


TAATATATTT 


TTATTATATT 


540 


ACCCCTAATT 


AATTATTATA 


AGTCATTTAT 


ATATTAGAAT 


TAATAATATT 


TAATTAAAAA 


600 


AATAGATATT 


TATCACGTTT 


GTTTCAGCTA 


CTTTAACCGT 


GATTTTACTA 


TATCATCCTT 


660 


AATTAATTAT 


TATAAGTCAT 


TTATTTATTA 


AAAAATTAAT 


AATATATAAT 


TAAAAAATAG 


720 


ATCACATGTC 


TG CCT AT ATT 


ACCCCTAATT 


AATTATTATA 


AGTCATTTAT 


TTATTAAAAA 


780 


ATTAATAATA 


TATAATTAAA 


AAATTGATGA 


G ACAT ACT TG 


AACTTTAGGA 


CGCTCATATT 


840 


ACCCCTAGAT 


TAATTAAAAT 


CGTATTTTTC 


ACAACCTTAG 


TCCCTACCTC 


CACCTTCAGT 


900 


CTCTTGATTT 


ACTGACGTAC 


CACTGTTGGG 


AGCACTACCT 


CCATAATGGG 


CCCTACAACG 




CCCGATTCAG 


ATTAGTTGAG 


ACTCCAATAC 


GGACATCAAA 


AATAGATGGA 


AAACCGAAAA 


1020 


AAAACAAAAT 


AAAAATAATG 


TGTGTTTAAT 


TTTCATTTCT 


AGATTAGTCA 


AAGTTATACA 


1080 


CTCACAAATT 


CATGATAATC 


AACTTTTTCA 


TCATACATGG 


CTAGGCTTTT 


TGAAAATCTA 


1140 


TATATATTTA 


GTAGTGGCGT 


CTTCATTCTT 


TACCCACATA 


GAAGAAATAG 


AAATTATTAT 


1200 


TTGAAAAAAA 


AAAAAAGATC 


CAAAAATATT 


ATT 






1233 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (primer) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CTACTTTGGA AAAGCCAGCC 20 



<2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 23 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



GCCAGCCATG GTAATATTTT TGG 



23 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 baee paira 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GATTTCACGG GTTGGGGTTT 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CAAAGTCTAG ATGGATATAG 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 

( ix ) FEATURE : 

(A) NAME/KEY: modified base 

(B) LOCATION: 6 ~ 

(D) OTHER INFORMATION: /mod^base- i 

(ix) FEATURE: 

(A) NAME /KEY: modified base 

(B) LOCATION: 9 " 

(0) OTHER INFORMATION: /mod base- i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 21 ~ 

(D) OTHER INFORMATION: /mod_baBe-= i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 24 " 

(D) OTHER INFORMATION: /rood base^ i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTRTCNCCNG CRTCRTARTA NCCNCC 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (primer) 



<ix) FEATURE: 

(A) NAME/KEY: laodif ied_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base« i 

( ix ) FEATURE : 

(A) NAME/KEY: modif ied_ba«e 

(B) LOCATION: IS 

<D) OTHER INFORMATION: /mod^base- i 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TCCATRTCYT CNGGNCGYTC CCARCA 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 20 base pairs 
(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATATATATAA CCATGGCTTG 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



AAATTGAACA TCTATAGTC 



19 
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WHAT IS CLATTjFp Jfi- 

1. An isolated plant Gp2 promoter, wherein 
said promoter comprises a nucleic acid sequence identical to 
or with substantial sequence identity to the nucleic acid 
sequence depicted in Seq. ID No. 4. 

2. The isolated plant Gp2 promoter of claim l 
wherein said promoter is from a member of the family 
Solanaceae . 

3. The isolated plant Gp2 promoter of claim 2 
wherein said promoter is from a pepper plant. 

4. The isolated plant Gp2 promoter of claim i 
wherein said Gp2 promoter is constitutive. 

5. The isolated plant Gp2 promoter of claim 4 
wherein said promoter is from about 250 to about 1,233 
nucleotides in length. 

6. The isolated plant Gp2 promoter of claim 5 
wherein said promoter is about 1233 nucleotides in length. 

7. The isolated plant Gp2 promoter of claim 6 
Wherein said promoter has the nucleotide sequence depicted in 
Seq. ID No. 4. 

8. The isolated plant Gp2 promoter of claim 1 
wherein said promoter is tissue-specific. 

9. The isolated plant Gp2 promoter of claim 8 
wherein said promoter is from about 1,500 to about 3,500 
nucleotides in length. 

10. The isolated plant Gp2 promoter of claim 9 
wherein said promoter is about 2,700 nucleotides in length. 
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11. The isolated plant Gp2 promoter of claim 10 
wherein said promoter is the Gp2 promoter sequence contained 
in plasmid PEP4-2GUS, 

12 • A vector comprising a plant Gp2 promoter 

operably linked to a heterologous nucleic acid sequence, 
wherein said Gp2 promoter comprises a nucleic acid sequence 
identical to or with substantial sequence identity to the 
nucleic acid sequence depicted in Seq, ID No, 4, 

13. The vector of claim 12 wherein said Gp2 
promoter is from a member of the family Solanaceae. 

14. The vector of claim 13 wherein said Gp2 
15 promoter is from a pepper plant* 

15. The vector of claim 12 wherein said Gp2 
promoter is a constitutive promoter. 

16. The vector of claim 15 wherein said Gp2 
promoter has the nucleic acid sequence depicted in Seq. ID No. 
4 . 



25 



17. The vector of claim 12 wherein said Gp2 
promoter is tissue-specific. 

18. The vector of claim 17 wherein said Gp2 
promoter is the Gp2 promoter of plasmid PEP4-2GUS. 

19. An expression cassette comprising a plant 
Gp2 promoter operably linked to a heterologous nucleic acid 
sequence, wherein said Gp2 promoter comprises a nucleic acid 
sequence identical to or with substantial sequence identity to 
the nucleic acid sequence depicted in Seq. ID No. 4. 



35 



20. The expression cassette of claim 19 wherein 

said Gp2 promoter is from a member of the family SoletnacGae . 
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21. The expression cassette of claim 20 wherein 
said Gp2 promoter is from a pepper plant. 

22. The expression cassette of claim 19 wherein 
said Gp2 promoter is a constitutive promoter. 

23. The expression cassette of claim 22 wherein 
said Gp2 promoter has the nucleic acid sequence depicted in 
Seq. ID No. 4. 

24. The expression cassette of claim 19 wherein 
said Gp2 promoter is tissue-specific. 

25. The expression cassette of claim 24 wherein 
said Gp2 promoter is the Gp2 promoter contained in plasmid 
PEP4-2GUS. 

26. An isolated plant Gp2 gene or gene fragment 
comprising a plant Gp2 promoter operably linked to a nucleic 
acid sequence encoding a plant Gp2 protein wherein said plant 
Gp2 protein has an amino acid sequence identical to or with 
substantial sequence identity to the amino acid sequence 
depicted in Seq. ID No. 2. 

27. The isolated plant Gp2 gene or gene 
fragment of claim 26 wherein said plant Gp2 protein is from a 
member of the family Solanaceae . 

28. The isolated plant Gp2 gene or gene 
fragment of claim 27 wherein said plant Gp2 protein is from a 
pepper plant. 

29. The isolated plant Gp2 gene or gene 
fragment of claim 28 wherein said Gp2 protein has the amino 
acid sequence depicted in Seq. ID, No. 2. 
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30. The isolated plant Gp2 gene or gene 
fragment of claim 2 8 wherein said Gp2 gene or gene fragment 
has the nucleic acid sequence depicted in Seq. ID No. 3. 

31. A transgenic plant comprising a plant Gp2 
promoter operably linked to a heterologous nucleic acid 
sequence, wherein said Gp2 promoter comprises a nucleic acid 
sequence identical to or with substantial secjuence identity to 
the nucleic acid sequence depicted in Seq, ID No. 4. 

32. The transgenic plant of claim 31 wherein 
said heterologous nucleic acid sequence is expressed 
constitutively under the control of the Gp2 promoter. 

^5 33. The transgenic plant of claim 32 wherein 

said Gp2 promoter has the nucleic acid sequence depicted in 
Seq. ID No. 4. 



10 



20 



34 • The transgenic plant of claim 31 wherein 

said heterologous nucleic acid sequence is expressed 
selectively in fruit tissue under the control of the Gp2 
promoter. 

35. The transgenic plant of claim 34 wherein 
2 5 said Gp2 promoter is the Gp2 promoter of plasmid PEP4-2GUS. 

36. The transgenic plant of claim 31 wherein 
said plant is a member of the family SalanaceQe. 

37. The transgenic plant of claim 36 wherein 
said plant is a pepper plant. 

38. The transgenic plant of claim 36 wherein 
said plant is a tomato plant. 



35 



39. The transgenic plant of claim 36 wherein 

said plant is a tobacco plant. 
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40. A method of expressing a heterologous 

nucl ic acid sequence in a plant cell comprising: 

a) transforming said plant cell with a vector comprising 
a Gp2 promoter operably linked to the heterologous 
nucleic acid sequence, wherein said Gp2 promoter 
comprises a nucleic acid sequence identical to or with 
substantial sequence identity to the nucleic acid 
sequence depicted in Seq. ID No. 4; and 

b) growing said plant cell under conditions where the 
heterologous nucleic acid sequence is expressed in said 
plant cell. 

41. The method of claim 40 wherein said Gp2 

promoter is constitutive. 



42. The method of claim 41 wherein said Gp2 

promoter has the nucleotide sequence depicted in Seq. id No. 



4 . 



43. The method of claim 40 wherein said Gp2 
promoter is tissue-specific. 

44. The method of claim 43 wherein said Gp2 
promoter is the Gp2 promoter contained in plasmid PEP4.2GUS. 

45. The method of claim 4 0 wherein said plant 
cell is from a member of the family Soianaceae. 

46. The method of claim 45 wherein said plant 
cell is a pepper plant cell. 

47. The method of claim 45 wherein said plant 
cell is a tomato plant cell. 

48. The method of claim 45 wherein said plant 
cell is a tobacco plant cell. 
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